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1.  INTRODUCTION 


-"This  report  describes  research  conducted  in  the  Cognitive  Psychophysiology. 
Laboratory  during  the  period  1/1/87-12/31/87  with  the  support,  or  partial 
support,  of  the  present  contract. 

As  in  previous  years,  we  have  continued  to  pursue  several  closely  related 

A 

goals.  Our  primary  mission  has  been  to  develop  an  understanding  of  the  Event- 
Related  Brain  Potential  (ERP)  so  that  it  can  be  used  in  the  study  of  human 
cognitive  function  and  in  the  assessment  of  man-machine  interactions.  To  this 
end,  we  have  conducted  research  in  the  following  areas:- 

a.  The  use  of  ERPs  in  the  study  of  attention  and  skill  acquisition 

b.  The  use  of  ERPs  in  the  study  of  mental  chronometry 

c.  The  use  of  ERPs  in  the  study  of  mental  resources  and  workload 

d.  The  use  of  ERPs  in  the  study  of  memory 

e.  The  development  of  an  animal  model  of  the  P300  component 

f.  The  use  of  ERPs  as  a  communication  channel 

g.  Miscellaneous  studies 

Listed  in  Section  3  are  all  chapters,  papers,  abstracts  and  presentations 
that  were  published,  submitted,  or  "in  preparation"  in  1987.  Since  this  is  the 
last  year  of  the  current  contract,  we  have  also  listed  publications  (Sections  4 
and  5)  that  report  the  results  of  all  research  supported  by  the  contract  in 
previous  years  (1/1/85-12/31/86). 

Full  reports  of  the  studies  for  the  current  year  are  included  in  the 
Appendix.  In  the  following  sections,  we  provide  e,  brief  description  of  the 
research.  Where  appropriate,  we  have  referred  to  the  appropriate  appendix 
items. 


Page  1 


2.  DESCRIPTION  OF  THE  RESEARCH  (1/1/87-12/31/87) 


Numbers  in  parentheses  refer  to  full  reports  of  the  research  listed  in 
Section  3  and  provided  in  the  Appendix. 

2.1  Attention  and  Skill  Acquisition 

The  focus  of  this  research  has  been  on  the  use  of  ERPs  to  provide 
converging  evidence  for  the  chronometric  and  energetical  changes  that  take  place 
during  the  development  of  highly  skilled  behaviors.  To  this  end,  we  have 
recorded  ERPs  along  with  a  number  of  traditional  performance  measures  such  as 
RMS  tracking  error,  RT  and  accuracy  in  paradigms  that  allow  for  the  development 
of  "automatic"  processing  (12,  25).  Several  important  results  have  been 
obtained  from  this  research.  First,  we  have  preliminary  evidence  which  argues 
that  there  are  a  number  of  dissociable  automatic  processes  that  develop  at 
different  rates.  Second,  the  results  of  dual-task  manipulations  indicate  that 
even  tasks  which  possess  "automatic"  processes  are  susceptible  to  interference 
in  a  manner  predicted  by  multiple  resource  models.  Third,  the  use  of  P300 
amplitude  in  conjunction  with  the  manipulation  of  multi-task  processing 
priorities  has  allowed  us  to  map  the  attentional  requirements  of  automatic  and 
non-automatic  processes. 

2.2  Mental  Chronometry 

The  focus  of  this  research  has  been  on  the  use  of  ERPs  to  measure  the 
timecourse  of  mental  processes.  A  related  aim  has  been  to  use  ERPs  to  examine 
the  information  processing  mechanisms  responsible  for  variability  in  measures  of 
overt  behavior.  In  our  most  recent  research,  we  have  used  measures  of  the 
readiness  potential,  recorded  from  lateral  electrode  sites  above  the  motor 
cortices,  to  assess  response  preparation.  These  measures,  as  well  as  those  of 
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the  electromyogram  (EMG),  suggest  that  information  transmission  among 
processing  elements  is  accomplished  continuously  -  or  at  least  in  several  steps 
(1  and  8). 

A  second  line  of  research  has  focussed  on  measures  of  the  readiness 
potential  as  predictors  of  response  latency  and  accuracy.  In  contrast  to  the 
claims  of  Gevins  et  al . ,  we  have  shown  that  this  ERP  measure  can  be  used  to 
predict,  in  advance  of  an  overt  response,  whether  the  response  will  be  accurate 
and  how  fast  it  will  be  (8).  This  finding  clearly  reveals  the  utility  of  the 
psychophysiological  approach  in  accounting  for  variation  in  overt  behavior. 

A  further  line  of  research  examines  both  the  utility  of  P300  as  an  index  of 
shifts  in  subject  strategy  (6)  and  as  a  marker  for  the  duration  of  stimulus 
evaluation  processes  (22). 

2.3  Mental  Resources  and  Workload 

This  research  has  focused  upon  the  use  of  ERPs  as  metrics  of  mental 
workload  and  resource  allocation  in  multi-task  environments  (for  reviews  see  9, 
15).  Previous  research  conducted  in  our  laboratory  has  indicated  that  P300 
amplitude  behaves  in  a  manner  predicted  by  multiple  resource  models.  Thus,  the 
amplitude  of  P300  appears  to  mimic  the  predicted  resource  tradeoffs  as  a 
function  of  task  difficulty  and  processing  priority.  In  recent  research,  we 
have  determined  that  (a)  the  reciprocity  in  P300  amplitudes  elicited  by  two 
concurrently  performed  tasks  is  predictive  of  single  subject  performance  over  a 
relatively  wide  range  of  task  variables  (20),  (b)  the  P300  effect  can  be 
generalized  to  non-laboratory  tasks  such  as  instrument  flight  (10),  (c)  and  that 
the  visual  N190  as  well  as  the  P300  components  reflect  resource  tradeoffs  both 
within  and  across  tasks  while  the  N160  is  sensitive  to  only  within  task  resource 
al locat ion  (11). 
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2.4  Memory 

We  demonstrated  earlier  that  there  is  a  relationship  between  the  P300 
response  to  a  stimulus  and  the  subsequent  memorability  of  that  stimulus.  This 
relationship  was  observed  in  subjects  who  used  a  rote  memorization  strategy,  but 
not  in  those  who  used  elaborative  strategies.  In  a  partial  replication  of  this 
experiment,  we  demonstrated  that  the  same  effects  can  be  observed  within  the 
same  subjects  (16).  Furthermore,  the  data  indicate  how  the  P300  measure  can  be 
used  to  evaluate  theories  of  memory  that  emphasize  distinctiveness  as  a  critical 
attribute  in  the  memorability  of  events. 

2.5  Animal  Models 

This  research,  which  has  been  referred  to  in  previous  reports,  demonstrates 
the  presence  of  probability-sensitive  neuronal  activity  in  rabbits  (21).  This 
finding  has  implications  for  the  search  for  the  neural  source  of  the  P300 , 
since  probability  sensitivity  is  one  of  the  critical  features  of  the  P300. 

2.6  ERPs  and  Communication 

We  have  continued  our  work  on  the  use  of  ERPs  in  communication.  We  have 
demonstrated  that  it  is  possible  to  determine,  by  evaluating  of  the  P300 
response  to  a  display,  which  element  in  the  display  the  subject  has  selected. 
When  these  elements  are  letters  of  the  alphabet,  the  procedure  enables  subjects 
to  use  their  ERPs  to  communicate  (17). 

2.7  Miscellaneous 

In  this  category,  we  include  studies  of  methodological  issues,  theoretical 
articles,  and  review  papers. 
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We  have  continued  our  efforts  to  devise  a  metric  for  the  assessment  of 
topographic  information.  Without  exception,  an  important  defining  characteris¬ 
tic  of  an  ERP  component  is  its  scalp  distribution.  Furthermore,  it  is  clear 
that  the  distribution  of  scalp-recorded  brain  potentials  nay  provide  critical 
clues  concerning  their  neural  source.  However,  until  this  time,  no  satisfactory 
methods  have  been  proposed  for  the  quantification  of  scalp  distribution.  A 
paper  describing  the  current  version  of  our  procedure  (18)  and  a  validation 
study  (19)  are  currently  under  final  editorial  review.  This  procedure  provides 
the  investigator  with  the  opportunity  of  dealing  with  the  problem  of  overlapping 
components,  and  of  distinguishing  statistically  between  different  scalp 
distributions. 

A  second  methodological  contribution  has  been  to  extend  the  method  of 
correction  for  eye-movement  artifact  to  include  both  horizontal  and  vertical 
movements  and  to  provide  a  program  for  the  general  public  to  implement  the 
correction  procedure  (14). 

We  have  published  two  theoretical  articles  this  year.  One  deals  with 
general  issues  concerning  the  role  of  theory  in  cognitive  psychophysiology  (3), 
the  other  examines  in  detail  the  context-updating  model  of  the  P300  (5).  A 
third  article  deals  with  issues  in  the  definition  and  reliability  of  measures  of 
the  P300  (7).  In  addition,  we  have  published  several  chapters  dealing  with  the 
utility  of  measures  of  the  ERP  in  the  study  of  human  factors  (9  and  15)  and 
psychological  issues  (2),  with  the  utility  of  measures  of  the  cardiovascular 
system  in  human  factors  research  (4),  and  with  the  analysis  of  aging  using  the 
psychophys iolog ica  1  approach  (13). 
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3.  PUBLICATIONS  FOR  THE  CURRENT  YEAR  (1/1/87-12/31/87) 


Note  that  full  reports  of  those  entries  preceded  by  a  number  are  provided 

in  the  appendix. 

3.1  Papers  and  Chapters 

1.  Coles,  M.  G.  H.,  Gratton,  G. ,  &  Donchin,  E.  (in  press).  Detecting  early 
communication:  Using  measures  of  movement-related  potentials  to  illuminate 
human  information  processing.  Biological  Psychology. 

2.  Coles,  M.  G.  H.,  Gratton,  C.,  &  Fabiani,  M.  (in  press).  Event-related 
brain  potentials.  To  appear  in  Cacioppo,  J.  T.  &  Tassinary,  L.  G. ,  (Eds.) 
Principles  of  Psychophysiology:  Physical,  Social,  and  Inferential  Elements. 
Cambridge:  Cambridge  University  Press. 

3.  Coles,  M.  G.  H.,  Gratton,  G.,  &  Gehring,  W.  J.  (1987).  Theory  in  cognitive 
psychophysiology.  Journal  of  Psychophysiology,  1,  13-16. 

4.  Coles,  M.  G.  H. ,  &  Sirevaag,  E.  (1987).  Heart  rate  and  Sinus  Arrhythmia. 

In  A.  Gale,  &  B.  Christie  (Eds.),  Psychophysiology  and  the  electronic 
workplace.  London:  John  Wiley  &  Sons. 

5.  Donchin,  E.,  &  Coles,  M.  G.  H.  (in  press).  Is  the  P300  component  a 
manifestation  of  context  updating?  The  Behavioral  and  Brain  Sciences. 

6.  Donchin,  E.,  Grafton,  G.,  Dupree,  D.  ,  &  Coles,  M.  G.  H.  (in  press).  After 
a  rash  action:  Latency  and  amplitude  of  the  P300  following  fast  guesses.  In 
G.  Galbraith,  M.  Klietzman,  &  E.  Donchin  (Eds.)  Neurophysiology  and 
Psychophysiology:  Experimental  and  Clinical  Applications.  Hillsdale,  NJ : 

Er lbaum. 

7.  Fabiani,  M. ,  Gratton,  G.,  Karis,  D. ,  &  Donchin,  E.  (in  press).  Definition, 
identification,  and  reliability  of  measurement  of  the  P300  component  of  the 
event-related  brain  potential.  In  P.  K.  Ackles,  J.  R.  Jennings,  &  M.  G.  H. 
Coles  (Eds.)  Advances  in  Psychophysiology,  Volume  2.  Greenwich,  CT :  JAI 
Press,  Inc.,  pp.  1-78. 

8.  Gratton,  G. ,  Coles,  M.  G.  H.,  Sirevaag,  E.,  Eriksen,  C.  W. ,  &  Donchin  E. 

(in  press).  Pre-  and  post-stimulus  activation  of  response  channels:  A 
psychophys iolog ical  analysis.  Journal  of  Experimental  Psychology:  Human 
Perception  and  Performance. 

9.  Kramer,  A.F.  Event-related  brain  potentials.  ('987).  In  A.  Gale  and 

B.  Christie  (Eds.),  Psychophysiology  and  the  Electronic  Workplace.  Sussex, 
England:  John  Wiley  and  Sons. 

10.  Kramer,  A.,  Sirevaag,  E. ,  &  Braune,  R.  (1987).  A  psychophysiological 
assessment  of  operator  workload  during  simulated  flight  missions.  Human 
Factors ,  29(2),  p.  145-160. 

11.  Kramer,  A.,  Sirevaag,  E.,  &  Hughes,  P.  (in  press).  Effects  of  foveal  task 
load  on  visual-spatial  event-related  brain  potentials  and  performance. 
Psychophysiology. 
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12.  Kramer,  A.  F. ,  &  Strayer,  D.  L.  (in  press).  Assessing  the  development  of 
automatic  processing:  An  application  of  dual-task  and  event-related  brain 
potential  methodologies.  Biological  Psychology. 

13.  Miller,  C.  A.,  Bashore,  T.  R. ,  Farwell,  L.  A.,  &  Donchin,  E.  (1987). 
Geriatric  psychophysiology.  In  K.W.  Schaie  (Ed.),  Annual  Review  of 
Gerontology  and  Geriatrics,  Volume  7  (pp.  1-27).  New  York:  Springer- 
V  e  r  1  ag  . 

14.  Miller,  G.  A.,  Cratton,  G. ,  &  Yee,  C.  M.  (in  press).  Generalized 
implementation  of  an  eye  movement  correction  procedure.  Psychophysiology. 

15.  Wickens,  C.  W.  (in  press).  Application  of  ERPs  to  human  factors.  In 
J.  Rohrbaugh,  R.  Johnson,  and  R.  Parasuraman  (Eds.),  Event-r el  a t  ed 
potentials  and  the  brain.  New  York:  Oxford  University  Press. 

3.2  Articles  Submitted  for  Publication 

16.  Fabiani,  M. ,  Karis,  D.  ,  &  Donchin,  E.  (  1987  ).  Effects  of  strategy 
manipulation  in  a  von  Restorff  paradigm.  Submitted  for  publication. 

17.  Farwell,  L.  A.  &  Donchin,  E.  (submitted).  Talking  off  the  top  of  your 

head:  A  mental  prosthesis  utilizing  event-related  brain  potentials. 

Electroencephalography  and  Clinical  Neurophysiology. 

18.  Gratton,  G.,  Coles,  M.  G.  H. ,  &  Donchin,  E.  (1987).  A  procedure  for  using 

multi-electrode  information  in  the  analysis  of  components  of  event-related 
potentials:  Vector  Filter.  Psychophysiology.  Submitted  for  publication. 

19.  Gratton,  G.,  Kramer,  A.  F.,  Coles,  M.  G.  H. ,  &  Donchin,  E.  (1987).  A 
simulation  study  of  the  latency  measures  of  components  of  event-related 
brain  potentials.  Psychophysiology.  Submitted  for  publication. 

20.  Sirevaag,  E. ,  Kramer,  A.,  Coles,  M.  G.  H. ,  &  Donchin.  E.  (1987).  Resource 

reciprocity:  An  event-related  brain  potentials  analysis.  Act  a 

Psychologica.  Submitted  for  publication. 

21.  Stolar,  N. ,  Sparenborg ,  S.  ,  Donchin,  E. ,  &  Gabriel,  M.  (  1987  ).  An  animal 
model  for  the  P300  component  of  the  event-related  potential  in  humans. 
Behavioral  Neuroscience.  Submitted  for  publication. 

3.3  Abstracts 

22.  Jenkins,  S.  ,  Gratton,  G.,  Coles,  M.  G.  H.,  &  Donchin,  E.  (  1987).  P300 

latency  and  task  requirements  (Abstract).  Psychophysiology,  24 ,  594. 

23.  Sirevaag,  E. ,  &  Kramer,  A.  (1987).  N100  and  P300  tuning  effects  during  an 
attention  switching  task  (Abstract).  Neuroscience,  1  7  ,  654. 
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3.4  Presentations 

Coles,  M.  G.  H.,  Gratton,  G.,  &  Donchin,  E.  (1987).  Changing  minds:  Using 
measures  of  movement  related  potentials  to  illuminate  evaluation  processes. 
Presented  at  The  IV  International  Conference  on  Cognitive  Neuroscience, 
Dourdan,  France. 


24.  Gehring,  W.  J.,  Strayer,  D.  L. ,  Kramer,  A.,  Donchin,  E.,  &  Miller,  G. 
(1987).  An  evaluation  of  age  differences  in  the  development  of 
automat icity.  Proceedings  of  The  IV  International  Conference  on  Cognitive 
Neuroscience,  Paris-Dourdan,  France. 


Gratton,  G.  (1987).  The  use  of  scalp 
of  the  Event-Related  Brain  Potential. 
Conference  on  Cognitive  Neuroscience, 


distribution  to  separate  components 
Presented  at  The  IV  International 
Dourdan,  France. 


Gratton,  G.  &  Coles,  M.  G.  H.  (1987).  Detecting  early  communication:  Can 
response  preparation  begin  before  stimulus  evaluation  ends?  Presented  at 
the  Annual  Hoosier  Mental  Life  Meeting,  Monticello,  IL. 


Gratton,  G.,  &  Coles,  M.  G.  H.  (1987).  Generalization  and  evaluation  of 
eye-movement  correction  procedures.  Presented  at  the  Workshop  on  Removing 
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Abstract 

In  this  paper  we  review  evidence  that  suggests  that  the  stimulus 
evaluation  system  can  pass  information  to  the  response  activation  system 
before  evaluation  is  completed  (“early  communication").  This  evidence  is 
derived  from  measures  of  the  aralized  readiness  potential,  which  have 
been  related  in  previous  research  to  the  preparation  for  movement.  Early 
communication  is  evident  in  conflict  and  congruence  paradigms.  In  both 
paradigms,  a  single  stimulus,  or  two  different  stimuli,  deliver  two  aspects 
of  information.  In  the  conflict  paradigm,  the  first  aspect  of  information 
(derived  from  preliminary  evaluation)  primes  the  incorrect  response,  while 
the  second  primes  the  correct  response.  In  the  congruence  paradigm, 
information  derived  from  preliminary  and  complete  evaluation  is  congruent. 

In  both  paradigms,  lateral ized  readiness  potential  measures  suggest  that 
preliminary  evaluation  is  able  to  prime  the  response  system,  although  the 
overt  motor  response  may  not  be  released  until  evaluation  is  completed. 

This  demonstration  of  early  communication  has  both  theoretical  and  practical 
implications.  First,  it  does  not  support  single-decision  models  of 
information  processing.  Second,  it  suggests  that  the  lateralized  readiness 
potential,  a  continuous,  analog  measure  of  the  activity  of  the  response 
system,  can  be  used  to  make  inferences  about  the  nature  of  the  evaluation 
process,  and  to  localize  the  effects  of  various  manipulations  on  the 
information  processing  system. 
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Detecting  early  communication:  Using  measures  of  movement-related 
potentials  to  illuminate  human  information  processing 

Michael  G.  H.  Coles,  Gabriele  Gratton,  &  Emanuel  Donchin 
1.  Introduction 

A  central  issue  in  contemporary  research  on  human  information 
processing  concerns  the  nature  of  the  transmission  of  information  between 
elementary  information  processing  activities  (e.g.,  Meyer,  Yantis,  Osman,  & 
Smith,  1986,  and  Meyer,  Osman,  Irwin,  &  Yantis,  this  volume).  Two  classes 
of  models  can  be  identified:  discrete  models,  that  assume  that  information 
is  transmitted  discretely,  only  when  processing  at  a  particular  level  of  the 
system  (or  stage)  is  completed  (e.g.,  Sanders,  1980;  Sternberg,  1969),  and 
continuous  models,  that  assume  that  information  is  transmitted  continuously, 
as  soon  as  it  is  available  (e.g.,  Eriksen  &  Shultz,  1979;  Grice,  Nullmeyer, 

&  Spiker,  1982;  McClelland,  1979).  A  hybrid  model  has  been  proposed  by 
Miller  (1982),  the  asynchronous  discrete  coding  model .  Like  discrete 
models,  this  model  assumes  that  information  is  transmitted  discretely,  but 
like  continuous  models,  it  assumes  that  information  may  be  transmitted 
before  processing  at  a  particular  level  is  completed.  Thus,  information  is 
transmitted  in  "chunks,"  whose  size  and  number  may  vary  as  a  function  of  the 
nature  of  the  information. 

The  psychophysiological  evidence  we  review  ,in  this  paper  suggests  that 
communication  between  stimulus  evaluation  and  response  activation  systems 
can  take  place  continuously,  or  at  least  in  chunks.  In  particular,  the 
activity  of  the  motor  system  can  be  influenced  by  preliminary  phases  of 
stimulus  evaluation. 
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1. 1  Early  Communication 

To  address  the  question  of  the  nature  of  communication,  it  is  assumed 
that  the  information  processing  system  can  be  represented  by  two  major 
subsystems:  a  stimulus  evaluation  system  responsible  for  the  identification 
of  stimulus  information,  and  a  response  activation  system  that  is  directly 
responsible  for  the  generation  of  overt  behavioral  responses.  This 
assumption,  which  is  prevalent  in  contemporary  theorizing  about  information 
processing  (see,  for  example,  Grice  et  al.,  1982;  Posner,  1978),  can  be 
traced  directly  to  the  work  of  Bonders  (1868/1969)  and  his  followers  (e.g. 
Sanders,  1980;  Sternberg,  1969).  A  further  assumption  is  that  each  of  these 
systems  .can  pass  through  intermediate,  states,  or  levels  of  activation, 
before  it  achieves  a  threshold  level  (Eriksen  &  Schultz,  1979;  McClelland, 
1979;  Meyer  et  al.,  1986;  Miller,  1982).  For  the  stimulus  evaluation 
system,  this  threshold  level  corresponds  to  "complete"  evaluation  of  the 
stimulus  (by  which  we  mean  that  sufficient  information  has  been  extracted 
from  the  stimulus  to  enable  the  subject  to  produce  the  correct  response). 
For  the  response  system,  the  threshold  level  corresponds  to  the  initiation 
of  the  response.  These  ideas  are  represented  by  the  activation  by  time 
functions  in  Figure  1. 


Insert  Figure  1  About  Here 

Given  these  assumptions,  the  question  of  communication  between  stimulus 
evaluation  and  response  systems  can  be  rephrased  as  follows:  can  the 
stimulus  evaluation  system  transmit  information  to  the  response  system 
before  evaluation  is  completed?  If  so,  then  the  response  system  should  be 
influenced  by  preliminary  phases  of  evaluation.  This  type  of  Influence  can 


Early  coronunicat ion 


Page  5 


be  referred  to  as  "early  communication."  Note  that,  in  Figure  1,  the 
presence  of  early  communication  is  associated  with  a  subthreshold  increase 
in  the  level  of  response  activation. 

2.  Measuring  Partial  Response  Activation 

The  response  activation  function  depicted  in  Figure  1  is,  of  course, 
hypothetical.  To  explore  this  function  and  detect  early  communication  we 
must  identify  procedures  that  can  be  used  to  reveal  the  presence  of 
subthreshold  response  activation.  One  approach  to  this  problem  is  to  use 
experimental  manipulations  which  allow  one  to  infer  the  presence  of  "early 
communication"  effects  from  measures  of  overt  behavior  (e.g.,  Miller,  1982; 
Proctor  &  Reeve,  1985;  Kounios,  Osman,  &  Meyer,  1987).  This  approach  may 
also  incorporate  the  analysis  of  reaction  time  distributions  (e.g.  Logan  & 
Cowan,  1984;  Meyer  et  al . ,  1986).  Another  approach  involves  the  use  of 
measures  of  peripheral  subthreshold  motor  activity  (the  electromyogram)  to 
infer  the  presence  of  partial  response  activation  (Coles,  Gratton,  Bashore, 
Eriksen,  &  Oonchin,  1985;  Eriksen,  Coles,  Morris,  &  O'Hara,  1985). 

In  this  paper,  however,  we  focus  on  event-related  brain  potential 
measures,  and  in  particular  on  the  lateral ized  negative  potentials  that  have 
been  found  by  many  investigators  to  precede  left-  and  right-hand  overt 
responses.  As  we  will  discuss  later,  these  measures  satisfy  several 
criteria  for  measures  of  response  activation.  First,  they  are  intimately 
related  to  the  the  activity  of  the  motor  system.  Second,  a  fixed 
relationship  can  be  observed  between  these  measures  and  the  emission  of  the 
overt  response.  Third,  they  are  affected  by  experimental  variables  that  are 
assumed  to  produce  response  activation  (e.g.,  priming,  bias,  etc.).  Fourth, 
they  are  related  to  the  parameters  of  the  subsequent  overt  response 
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(accuracy,  and  latency). 

These  lateral  iced  negativities  were  first  observed  in  studies  of 
voluntary  movements  (Kornhuber  &  Deeke,  1965).  When  subjects  anticipate 
making  a  response  with  a  particular  hand,  an  increase  in  negativity  occurs 
that  is  larger  at  scalp  sites  contralateral  to  the  responding  hand  (Kutas  & 
Oonchin,  1974;  Vaughan,  Costa,  &  Ritter,  1972). 

The  scalp  sites  at  which  these  potentials  are  maximal  are  those  over 
lateral  central  areas  (C3  and  C4)  or  over  adjacent  sites  (frequently 
described  as  C31  and  C4 ' ) .  Note  that  these  sites  are  in  close  proximity  to 
those  areas  of  the  brain  that  are  presumed  to  control  movement  (Penfield  8. 
Jasper,  1954).  Of  course,  proximity  to  a  particular  brain  area  $  no  — 
guarantee  that  an  electrode  is  sensitive  to  activity  of  that  brain  area. 
However,  in  this  case,  there  is  evidence  from  a  variety  of  studies,  some  of 
which  we  review  below,  to  relate  the  lateral ized  negative  scalp  potentials 
to  activity  in  motor  brain  areas. 

We  refer  to  these  negativities  as  the  "lateralized  readiness  potential" 
and  will  focus  in  this  paper  on  the  difference  in  potential  between  scalp 
sites  contralateral  and  ipsilateral  to  particular  responses. 

The  claim  that  this  potential  is  related  to  the  activity  of  the  motor 
system  is  supported  by  functional  and  neurophysiological  evidence.  First, 
as  we  noted  previously,  this  potential  precedes  movements.  Second, 
neurophysiological  data  suggest  that  the  source  of  the  potential  may  lie,  at 
least  in  part,  in  motor  regions  of  the  cortex  (e.g.,  Arezzo  &  Vaughan,  1975; 
Okada,  Williamson,  &  Kaufman,  1982).  There  are  also  strong  similarities 
between  the  activity  of  units  in  the  motor  cortex  and  the  potential  (Requin, 
1985).  For  these  reasons,  it  is  clear  that  the  potential  is  related  to 
motor  activity. 
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The  fixed  relationship  between  the  lateralization  of  the  readiness 
potential  and  the  production  of  the  motor  response  is  evident  in  data  from 
Gratton,  Coles,  Sirevaag,  Eriksen,  and  Donchin  (in  press).  In  this  study,  a 
fixed  level  of  lateralization  was  observed  at  the  moment  of  the  onset  of  EMG 
activity,  regardless  of  response  accuracy  or  latency. 

A  variety  of  studies  have  demonstrated  that  the  measure  is  influenced 
by  those  variables  that  are  assumed  to  produce  partial  response  activation. 
In  one  class  of  studies,  information  about  a  future  response  is  provided  by 
a  valid  precue.  Such  information  should  be  associated  with  a  sub-threshold 
increase  in  activation  of  the  predicted  response.  In  the  most  extreme  case, 
the  precue  provides  complete  information  about  the  response  to  be_given,  amd 
the  imperative  stimulus  only  provides  temporal  information.  For  example, 
Kutas  and  Donchin  (1980)  studied  the  lateralized  readiness  potential  in  the 
context  of  a  two-choice  reaction  time  task,  in  which  the  two  responses  were 
assigned  to  left  and  right  hands.  In  one  condition  the  precue  (warning 
stimulus)  perfectly  predicted  the  imperative  stimulus  that  would  occur  one 
second  later.  In  another  condition,  the  precue  did  not  provide  advance 
information.  In  the  precued  condition,  the  lateralized  readiness  potential 
developed  during  the  foreperiod  with  greater  negativity  at  scalp  sites 
contralateral  to  the  response  associated  with  the  predicted  stimulus.  In 
the  non-precued  condition,  on  the  average,  lateralization  was  not  evident 
until  after  the  response  relevant  information  was  delivered  by  the 
imperative  stimulus.  These  data  are  presented  in  Figure  2.  Rohrbaugh, 
Syndulko,  and  Lindsley  (1976)  and  Gaillard  (1978)  observed  a  similar 
lateralization  of  the  readiness  potential  during  the  foreperiod  of  simple 
warned  reaction  time  tasks. 
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Insert  Figure  2  About  Here 


Lateralization  effects  can  also  be  observed  when  the  precue  provides 
only  probabilistic  information  about  the  upcoming  imperative  stimulus  (and 
future  response).  In  fact,  when  the  validity  of  the  precue  is  .8, 
lateralization  effects  are  still  present  (Bosco  et  al.,  1986;  Gehring, 

Gratton,  Coles,  &  Donchin,  1986). 

Another  line  of  evidence  comes  from  studies  of  the  relationship  between 
lateralization  in  the  foreperiod  of  reaction  time  tasks  and  the  accuracy -*«d 
latency  of  responses.  In  particular,  the  probability  that  a  particular 
fast-guess  response  will  be  given  depends  on  the  degree  of  lateralization 
during  the  foreperiod  (Gehring  et  al.,  1986;  Gratton  et  al . ,  in  press). 
Additional  evidence  indicates  that,  in  a  simple  reaction  time  task,  fast 
responses  are  associated  with  greater  lateralization  than  slow  responses 
(Rohrbaugh  et  al.,  1976). 

Evidence  for  the  role  of  lateralized  brain  electrical  activity  in  the 
preparation  for  movement  is  also  provided  by  studies  of  the  effects  of 
externally  applied  direct  currents  on  reaction  time.  Birbaumer  and  his 
colleagues  (e.g.,  Jaeger,  Elbert,  Lutzenberger,  &  Birbaumer,  in  press) 
applied  currents  of  different  directions  at  C 3  and  C4.  This  procedure  is 
believed  to  result  in  lateralized  polarization  of  cortex.  These 
investigators  report  a  small,  but  significant  effect  on  reaction  time  as  a 
function  of  the  hand  used  to  respond  and  direction  of  current  flow. 

In  summary,  data  from  a  variety  of  sources  converge  in  suggesting  that 
the  lateralized  readiness  potential  can  be  used  as  a  measure  of  the  relative 
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activation  of  responses  (when  the  response  are  assigned  to  different  hands). 
The  potential  (a)  appears  to  be  generated  (at  least  in  part)  in  motor 
regions  of  the  brain,  (b)  has  a  fixed  level  at  the  moment  of  response 
emission,  (c)  is  influenced  by  those  variables  that  affect  response 
activation,  and  (d)  is  related  to  the  accuracy  and  latency  of  overt 
behavioral  responses. 

In  measuring  the  lateralized  readiness  potential  it  is  important  that 
steps  are  taken  to  ensure  that  only  movement-related  potentials  are 
represented  in  the  lateralization  waveforms.  First,  experimental  conditions 
are  established  so  that  parameters  of  left-  and  right-hand  responses  (such 
as  response  latency)  are  comparable.  Second,  averages  for  left-  and  — 
right-hand  responses  are  computed  separately.  Third,  these  averages  are 
then  combined  with  equal  weight.  If  these  steps  are  followed,  asyrmietri cal 
activity  unrelated  to  the  side  of  movement  should  average  to  zero  leaving 
only  movement-related  activity  (see  also  De  Jong,  Wierda,  Mulder,  &  Mulder, 
in  press;  Gratton  et  al.,  in  press). 

3.  Early  Communication  Paradigms 

Having  found  a  suitable  measure  of  partial  response  activation,  we  now 
need  to  consider  paradigms  in  which  we  can  use  the  measure  to  investigate 
the  question  of  early  communication  between  stimulus  and  response  systems. 
Recall  that  we  are  trying  to  determine  whether  preliminary  stimulus 
evaluation  can  lead  to  partial  response  activation.  Thus,  a  suitable 
paradigm  must  provide  the  opportunity  for  the  stimulus  evaluation  system  to 
transmit  preliminary  information  to  the  response  system  that  is  in  some  way 
distinguishable  from  that  derived  from  complete  evaluation.  By  "complete 
evaluation"  we  mean  evaluation  of  all  the  information  required  to  give  the 
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correct  response  -  In  the  present  paper  we  consider  two  paradigms  that 
appear  to  provide  this  opportunity,  the  conflict  paradigm  and  the  congruence 
paradigm. 

3. 1  The  Conf  1  i ct  Paradigm 

The  conflict  paradigm  is  based  on  the  idea  that  two  aspects  of  the 
stimulus  information  prime  different  responses.  Furthermore,  the 
manipulations  are  arranged  such  that  the  two  aspects  are  analyzed  (and  are 
therefore  available  to  the  response  system)  at  different  moments  in  time. 
Thus,  the  effects  of  preliminary  and  complete  evaluation  are  in  conflict. 

In  particular,  experimental  contingencies  in  a  two-choice  reaction  time  task 
are  arranged  such  that  preliminary  evaluation  should  lead  to  one  responsei-- 
while  complete  evaluation  should  lead  to  the  other  response.  Ii  the 
responses  are  assigned  to  different  hands,  brain  potentials  associated  with 
one  response  should  be  distinguishable  from  those  associated  with  the  other 
response.  Since  preliminary  evaluation  should  lead  to  the  activation  of  the 
incorrect  response,  while  complete  evaluation  should  lead  to  activation  of 
the  correct  response,  a  reversal  in  lateral ization  when  conflicting 
information  is  presented  provides  evidence  for  early  communication.  These 
ideas  are  represented  diagrammatical ly  in  Figure  3. 


Insert  Figure  3  About  Here 


We  describe  here  three  different  instances  of  the  conflict  paradigm. 
They  involve  different  degrees  of  separability  of  the  two  aspects  of  the 
information. 

In  the  first  instance  (Precueinq  Task,  e.g.,  Bosco  et  al.,  1986; 

Gehring  et  al.,  1986),  preliminary  information  is  delivered  by  one  stimulus, 
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while  complete  information  is  delivered  by  a  second  stimulus  presented 
sometime  later.  Thus,  information  1  and  2  (see  Figure  3)  are  delivered  at 
different  times. 

In  the  second  instance  ( No i  se-Compatibi lity  T ask ,  e.g.,  Gratton  et  al . , 
in  press;  Smid,  Mulder,  &  Mulder,  1988),  all  the  information  is  presented  in 
a  single  visual  array.  However,  the  conflicting  response-relevant 
dimensions  of  the  array  are  spatially  separated.  A  target  letter,  to  which 
the  subject  must  respond,  is  surrounded  by  noise  letters  that  call  for  the 
incorrect  response. 

In  the  third  instance  ( "Stroop11  Task,  Buckley,  Gratton,  Kramer,  Coles, 

&  Donchin,  1988),  the  competing  response-relevant  dimensions  are  provided-tey 
the  same  stimulus.  The  stimulus  is  a  word  whose  orthographic  attributes 
call  for  the  incorrect  response  and  whose  phonologic  characteristics  call 
for  the  correct  response. 

Thus,  the  three  examples  represent  different  degrees  of  separability  of 
the  two  response-relevant  stimulus  dimensions.  In  the  precueing  task,  they 
are  temporally  separated.  In  noise-compatibility  and  “Stroop"  tasks,  they 
are  different  attributes  of  the  same  stimulus.  In  the  noise-compatibility 
task,  the  attributes  are  also  spatially  distinguishable. 

3.1.1  The  Precueinq  Task.  In  this  task  (Bosco  et  al.,  1986;  Gehring 

et  al.,  1986)  one  of  two  warning  stimuli  informs  the  subject  that  the  same, 

or  the  other,  stimulus  will  appear  after  a  second  or  so  with  a  particular 

probability  (e.g.,  .8).  Subjects  must  execute  a  response  to  the  second 
stimulus  (i.e.,  imperative  stimulus)  with  one  hand  or  the  other.  In  this 
situation,  reaction  time  data  suggest  that  subjects  generally  use  the 
warning  information  to  prime  the  appropriate  response. 

As  can  be  seen  in  the  upper  panel  of  Figure  4,  the  lateralized 
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readinessjpotential  data  suggest  that  subjects  prime  the  appropriate 
response  during  the  foreperiod,  when  the  precue  provides  useful  information. 
As  the  lower  panel  of  Figure  4  indicates,  when  the  precue  has  no  predictive 
value,  there  is,  on  the  average,  no  lateralization  during  the  foreperiod. 

The  lateralization  only  develops  after  the  imperative  stimulus. 1 
In  the  predictive  condition  (upper  panel),  if  the  imperative  stimulus 


Insert  Figure  4  About  Here 


matches  the  warning  stimulus,  the  degree  of  lateralization  increases  in  the 
direction  of  the  predicted  (same)  response.  However,  if  an  unpredicted 
imperative  stimulus  is  presented,  as  happens  on  20%  of  the  trials,  then  the 
lateralization  reverses  in  the  direction  of  the  (different)  response.  These 
data  suggest  that,  when  conflicting  information  is  delivered  at  different 
points  in  time,  subjects  begin  by  priming  one  response  and  then  reverse 
their  priming  and  execute  the  other  response. 

The  next  two  experiments  address  a  more  challenging  question:  If 
information  calling  for  conflicting  responses  is  carried  by  the  same 
stimulus,  will  we  see  the  same  pattern  of  reversal  in  lateralization?  That 
is,  will  the  changes  in  lateralized  readiness  potential  evident  in  Figure  4 
be  compressed  into  the  post-stimulus  epoch?  If  the  answer  to  this  question 
is  "yes,"  then  we  will  have  evidence  that  preliminary  phases  of  evaluation 
can  lead  to  the  activation  of  the  response  system. 

3.1.2  The  Noise-Compatibility  Task.  In  this  task  (Gratton  et  al ,  in 
press;  Smid  et  al.,  1988),  an  array  of  letters  is  presented  on  a  screen,  and 
subjects  are  instructed  to  respond  with  their  left  or  right  hands  depending 
on  the  "target"  letter  appearing  on  the  center  of  the  array.  Letters  that 
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flank  the,  target  letter  (noise  letters)  can  be  the  same  as  the  target 
(compatible-noise  trials),  or  those  that  call  for  the  other  response 
(incompatible-noise  trials). 

Coles  et  al .  (1985)  and  Gratton  et  al.  (in  press)  have  shown  that 
information  about  the  letters  in  the  array  is  available  during  preliminary 
phases  of  stimulus  evaluation,  but  that  subjects  cannot  locate  the  target 
letter  until  later  (cf.  Treisman  &  Gelade,  1980).  Therefore,  we  should  see 
the  reversal  in  the  laterali2ed  brain  potential  when  the  target  letter  and 
noise  letters  conflict  -  if  early  communication  occurs. 

Data  from  the  Gratton  et  al.  (in  press)  study  are  presented  in  Figure 
5.  In  the  top  panel,  the  overt  behavioral  data  are  presented  in  the  form-of 
speed-accuracy  trade-off  function  that  Luce  (1986)  calls  "conditional 
accuracy  functions."  The  function  for  incompatible  noise  stimuli  reveals 
that  there  is  a  response  latency  (between  150  and  250  ms)  for  which  accuracy 
is  less  than  chance.  This  is  not  true  for  compatible  stimuli.  Accuracy  is 
above  chance  for  responses  to  compatible  noise  trials  with  a  latency  between 
150  and  250  ms.  This  indicates  that,  when  subjects  respond  at  this  latency, 
the  accuracy  of  their  responses  is  more  dependent  on  the  type  of  noise 
letters  than  on  the  target  letter.  It  appears  that,  at  this  latency,  the 
subject  has  identified  what  the  letters  are,  but  does  not  know  where  they 
are. 


Insert  Figure  5  About  Here 


For  responses  of  longer  latency,  response  accuracy  is  high  regardless 
of  the  compatibility  of  the  noise.  This  suggests  that  the  accuracy  of  these 
responses  is  the  result  of  correct  identification  and  localization  of  the 
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target  letter. 

Thus,  the  conditional  accuracy  functions  support  the  idea  that 
preliminary  stimulus  evaluation  results  in  identification  of  the  letters  in 
the  array,  and  that  later,  "complete'*  evaluation  results  in  the  location  of 
the  target  letter. 

Lateral ized  readiness  potential  data  are  shown  in  the  middle  panel  of 
Figure  5.  Note  that  these  functions,  like  the  conditional  accuracy 
functions  suggest  that  there  is  a  time  following  stimulus  presentation  when 
the  incorrect  response  is  preferentially  primed  if  the  array  contains 
incompatible  noise. 

Although  suggestive,  these  data  do  not  necessarily  support  the  idea _q£ 
early  communication.  In  fact,  both  lateral ization  waveforms  and  conditional 
accuracy  functions  are  the  result  of  aggregating  data  over  trials  -  and,  for 
this  reason,  they  may  not  present  an  accurate  picture  of  what  is  going  on 
during  individual  trials.  The  lateralization  data  are  based  on  averages  of 
all  trials  regardless  of  the  accuracy  of  the  response.  We  know  that  the 
readiness  potential  lateralizes  just  prior  to  the  execution  of  an  overt 
response.  For  incorrect  responses,  the  lateralization  would  be  associated 
with  a  downward  deflection  of  the  trace.  Thus,  the  reversal  seen  in  the 
average  lateralization  waveform  for  incompatible  trials  may  be  due  to  the 
fact  that,  at  intermediate  latencies,  the  average  lateralization  values 
reflect  the  contribution  of  the  larger  number  of  trials  with  Incorrect 
responses  at  these  latencies.  The  evidence  we  need  to  demonstrate  that 
early  communication  can  occur  is  a  reversal  in  the  lateralization  on 
individual  trials.  Obviously,  this  is  a  problem  because  one  must  aggregate 
data  in  order  to  enhance  the  signal/noise  ratio  of  the  lateral ized  readiness 
potential. 
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We  reasoned  that  a  solution  to  this  problem  was  to  select  a  subset  of 

trials  for  which  only  the  correct  overt  response  was  given  and  for  which 

overt  response  latency  was  relatively  constant  (300-349  msec).  We  have 
evidence  that  responses  of  this  latency  are  not  likely  to  be  the  result  of 
fast  guesses  (Gratton,  et  al . ,  in  press)  and  that  the  subjects  were 
responding  mainly  on  the  basis  of  a  complete  evaluation  of  the  stimulus. 

Thus,  any  reversal  in  the  lateralized  readiness  potential  seen  for  this 
subset  of  trials  cannot  be  attributed  to  the  inclusion  of  incorrect  response 

trials.  Rather,  it  should  be  attributed  to  partial  incorrect  response 

activation  as  a  result  of  early  communication  on  these  trials. 

Averages  for  these  trials  are  shown  in  the  lower  panel  of  Figure  5. 

Note  that,  on  incompatible  response  trials  the  reversal  in  lateralization  is 
clearly  visible  and  significant. 2  These  data  support  the  inference  that 
early  communication  is  possible  and  that,  at  least  on  some  trials  in  this 
task,  preliminary  information  was  transmitted  to  the  response  system  before 
the  stimulus  was  completely  evaluated. 

Similar  data  have  been  obtained  by  Smid  et  al.  (1988).  Their  data  are 
shown  in  Figure  6.  More  conditions  were  included  in  this  experiment; 
however,  the  reversal  in  lateralization  is  clearly  evident  in  the  data  for 
incompatible  noise.  These  averages  are  based  on  correct  response  trials 
only  -  but  the  latency  of  the  overt  response  was  not  restricted. 


Insert  Figure  6  About  Here 


3.1.3  The  “Stroop"  Task.  In  this  task,  conflicting  information  is 
carried  by  different  attributes  of  the  same  stimulus.  In  the  classic  Stroop 
paradigm,  names  of  colors  are  written  in  different  colored  inks.  When  the 
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color  name  and  ink  color  are  different,  and  the  subjects  must  name  the  ink 
color,  conflict  is  evident  in  reaction  time  and  accuracy  measures. 

Buckley  et  al .  (1988)  studied  the  lateralized  readiness  potential  in  a 
variant  of  the  Stroop  task.  In  this  experiment,  information  derived  from  an 
orthographic  analysis  of  a  visually  presented  word  conflicted  with 
information  derived  from  a  phonological  analysis  (cf.  Polich,  McCarthy, 

Wang,  &  Donchin,  1983;  Kramer  &  Donchin,  1987).  Two  words,  separated  by 
1000  ms,  were  presented  to  the  subject.  Four  types  of  word  pairs  were  used, 
in  which: 

a.  Words  rhymed  and  looked  alike  (e.g.  COOK  -  BOOK); 

b.  Words  rhymed  and  did  not  look  alike  (e.g.  DOUGH  -  FLOW);  — 

c.  Words  did  not  rhyme  but  looked  alike  (e.g.  DOVE  -  MOVE); 

d.  Words  neither  rhymed  nor  looked  alike  (e.g.  TABLE  -  SOAP). 

In  one  session,  the  subjects'  task  was  to  judge  whether  the  words 
rhymed,  in  another,  subjects  had  to  judge  whether  the  words  looked  alike. 
Note  that  conflict  is  present  in  two  types  of  word  pairs:  DOUGH  -  FLOW  and 
DOVE  -  MOVE.  For  half  the  subjects,  "yes"  judgements  were  indicated  with 
the  right  hand,  and  "no"  judgements  with  the  left.  The  converse  was  the 
case  for  the  other  subjects. 

Reaction  time  data  revealed  that  the  effects  of  conflict  were  much  more 
potent  in  the  rhyme  task  than  in  the  visual  task.  A  detailed  analysis  of 
conditional  accuracy  functions  suggested  that  information  about  the 
orthography  was  available  before  information  about  phonology. 

The  question  is,  therefore,  does  the  reversal  in  lateralization  occur 
for  conflict  words  in  the  rhyme  task?  No  reversal  would  be  expected  in  the 
visual  task  because  the  correct  response  can  be  made  on  the  basis  of  the 
orthographic  information,  which  is  available  first.  The  relevant  data  are 
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shown  in  Ihe  upper  panel  of  Figure  7.  In  the  conflict  conditions,  when  the 
words  look  alike  but  do  not  rhyme,  or  rhyme  but  do  not  look  alike,  there  is 
a  reversal  in  the  lateralized  brain  potential. 3  When  there  is  no  conflict, 
there  is  no  reversal.  The  lateralization  data  for  the  visual  task  are 
presented  in  the  lower  panel  of  Figure  7.  When  the  subject's  task  is  to 
judge  whether  the  words  look  alike  (the  visual  task),  there  is  no  reversal 
in  lateralization  when  phonology  conflicts  with  orthography. 


Insert  Figure  7  About  Here 


3.1.4  Summary.  The  data  from  the  different  instances  of  the  conflict 
paradigm  converge  in  suggesting  that  early  communication  between  the 
stimulus  evaluation  and  response  systems  does  indeed  occur.  Whether 
conflicting  information  is  presented  at  different  moments  in  time,  or  by 
different  attributes  of  the  same  physical  stimulus,  subjects  appear  to 
activate  the  incorrect  response  based  on  preliminary  evaluation.  The 
correct  response  may  be  activated  later,  when  more  information  about  the 
stimulus  is  processed. 

3.2  Congruence  Paradigm 

In  the  previous  section,  we  have  seen  how  the  lateralized  readiness 
potential  provides  evidence  for  early  communication  when  the  effects  of 
preliminary  evaluation  conflict  with  those  of  "complete"  evaluation.  As 
with  the  conflict  paradigm,  the  congruence  paradigm  focuses  on  the  idea  that 
a  particular  response  is  somehow  influenced  by  two  distinct  aspects  of 
stimulus  information.  However,  while  In  the  conflict  paradigm  the 
information  is  in  conflict,  in  the  congruence  paradigm  it  is  congruent.  The 
two  aspects  of  information  may  be  delivered  by  two  different  stimuli 
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presented. at  different  times,  or  by  the  same  stimulus.  In  the  latter  case, 
the  experimental  contingencies  are  such  that  one  aspect  of  the  information 
is  available  before  the  other.  The  congruence  paradigm  is  illustrated  in 
Figure  8. 


Insert  Figure  8  About  Here 


The  Bosco  et  al.  (1986)  and  the  Gehring  et  al .  (1986)  experiments, 
described  earlier,  provide  examples  of  a  situation  in  which  a  warning 
stimulus  delivered  information  that  was  congruent  with  a  subsequent 
•imperative  stimulus  80%  of  the  time.  As  can  be  seen  in  Figure  4,  under  — 
these  circumstances  lateralization  develops  in  the  foreperiod  in  the 
direction  of  the  expected  response,  and  when  a  congruent  imperative  stimulus 
is  presented,  the  lateralization  further  develops  in  the  same  direction. 

Can  we  see  a  similar  influence  of  partial  priming  when  the  two  aspects  of 
the  stimulus  information  are  presented  simultaneously?  In  the  Gratton  et 
al.  (in  press)  study,  this  situation  is  realized  when  target  and  noise 
letter  are  the  same  (the  compatible-noise  condition).  As  the  bottom  panel 
of  Figure  5  shows,  this  condition  is  associated  with  an  early  lateralization 
in  the  direction  of  the  correct  response,  which  is  quite  distinguishable 
from  the  later  lateralization,  and  which  is  symnetrical  to  the 
lateralization  in  the  direction  of  the  incorrect  response  in  the 
incompatible-noise  condition. 

In  the  preceding  examples,  the  number  of  alternative  responses  was 
always  two,  and  the  target  stimulus  could  vary  along  one  dimension. 
Furthermore,  the  partial  Information  (the  precue  or  the  noise  letters)  was 
not  always  valid.  In  contrast,  in  the  Miller  paradigm  (Miller,  1982),  used 
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by  Oe  Jong  et  al.  (in  press),  the  partial  information  is  always  valid,  but 
it  is  incomplete.  In  this  paradigm,  subjects  perform  a  four-choice  reaction 
time  task,  in  which  responses  are  assigned  to  the  index  and  middle  fingers 
of  each  hand.  Imperative  stimuli  vary  along  two  dimensions  (e.g.,  size  and 
letter  name),  and  each  response  is  assigned  to  a  unique  combination  of  the 
two  dimensions.  In  one  condition,  a  precue  (occurring  .5  sec  before  an 
imperative  stimulus)  provided  information  that  the  forthcoming  imperative 
stimulus  would  require  a  response  by  one  of  two  fingers  of  a  particular 
hand.  In  this  case,  a  1 ateral izat ion  developed  in  the  foreperiod  in  the 
direction  associated  with  the  cued  hand.  Following  the  imperative  stimulus 
which  specified  the  finger  to  be  used  in  the  response,  the  lateralization— 
continued  to  develop  in  the  same  direction.  Lateralization  in  the 
foreperiod  was  not  observed  when  the  precue  specified  the  finger  (on 
different  hands)  that  would  have  to  be  used. 

In  another  condition,  De  Jong  et  al .  (in  press)  sought  to  determine 
whether  early  lateralization  would  occur  when  the  partial  information  about 
hand  and  complete  information  about  finger  were  delivered  by  the  same 
stimulus.  To  insure  the  temporal  dissociation  between  the  two  aspects  of 
the  information,  the  discrimirabi 1 ity  of  the  two  stimulus  dimensions  was 
made  different.  It  was  assumed  that  information  concerning  the 
easy-to-discriminate  dimension  would  be  processed  faster  than  the 
information  concerning  the  difficult-to-discriminate  dimension.  Thus,  if 
early  communication  exists,  there  should  be  partial  response  priming  on  the 
basis  of  the  easy  dimension.  Three  basic  stimulus/response  assignments  were 
used.  In  the  first,  analysis  of  the  easy  dimension  should  permit  the 
subject  to  prime  the  response  hand  (PREPARE  HANDS),  in  the  second,  the  same 
finger  on  each  hand  (PREPARE  FINGERS),  and,  in  the  third,  a  different  finger 
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on  each  hand  (PREPARE  NEITHER). 

Oe  Jong  et  al.  (in  press)  predicted  that  partial  response  priming 
should  be  revealed  by  earlier  lateralization  of  the  readiness  potential  in 
the  PREPARE  HANDS  condition,  than  in  the  other  two  conditions.  This  is 
because  information  provided  by  the  easy-to-discriminate  aimension  should 
permit  the  subject  to  start  preparing  the  response  hand  before  selecting  the 
appropriate  finger  i_f  information  about  the  hand  was  available  early  in  the 
evaluation  process.  Note  that  early  communication  might  occur  in  the  other 
two  conditions,  but  it  would  not  be  detectable  since  it  would  result  in 
partial  priming  of  individual  fingers  on  both  hands. 

In  fact,  De  Jong  et  al.  (in  press)  found  no  evidence  for  earlier 
lateralization  in  the  PREPARE  HANDS  condition.  The  onset  of  the 
lateralization  did  not  differ  among  the  three  conditions.  These  data 
suggest,  therefore,  that  response  priming  on  the  basis  of  preliminary 
stimulus  evaluation  does  not  occur  in  this  case. 

3.2.1  Summary.  In  contrast  to  the  conflict  paradigm,  the  congruence 
paradigm  does  not  provide  consistent  support  for  the  idea  of  early 
conmunication.  When  the  congruent  information  is  delivered  at  different 
moments  in  time  by  two  stimuli,  the  readiness  potential  clearly  suggests 
that  the  subjects  prime  their  responses  on  the  basis  of  the  preliminary, 
partial  information.  When  congruent  information  is  delivered  by  the  same 
stimulus,  the  effect  of  early  conmunication  is  not  always  apparent.  It  is 
evident  when,  as  in  the  compatible  condition  of  the  noise-compatible  task, 
the  preliminary  information  is  sufficient  to  lead  to  the  preparation  of  one, 
and  only  one  response.  It  is  not  evident,  however,  in  the  Miller  task,  used 
by  De  Jong  et  al.  (in  press),  where  preliminary  information  is  compatible 
with  more  than  one  response.  In  the  PREPARE-HANDS  condition,  such 
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information  identifies  the  hand  to  be  used  ia  responding,  but  does  not 
specify  the  finger. 

This  difference  between  these  two  kinds  of  congruence  tasks  may  be 
associated  with  different  strategies.  Since  preliminary  evaluation  in  the 
Miller  task  is  insufficient  to  define  the  required  response,  subjects  may 
prime  that  response  to  a  lesser  degree  than  when  the  response  is  completely 
specified.  Thus,  in  the  Miller  task  the  lateralization  associated  with 
partial  evaluation  will  be  smaller,  and  therefore  less  detectable,  than  in 
the  noise-compatibility  task.  Two  other  possibil  ties  are  suggested  by  De 
Jong  et  al.  (in  press).  First,  subjects  may  not  use  the  results  of 
preliminary  evaluation  in  the  Miller  task  to  prime  their  responses.  — 

Instead,  they  may  wait  until  the  stimulus  has  been  completely  evaluated  and 
then  select  and  prepare  the  required  response  in  one  step,  rather  than 
selecting  the  response  hierarchically  (e.g.t  hand  first,  then  finger).  In 
this  case,  of  course,  no  effects  on  the  lateralized  readiness  potential 
would  be  expected.  Second,  while  subjects  may  select  their  responses 
hierarchically,  such  a  strategy  may  not  be  driven  by  the  relative  time  at 
which  information  about  the  two  dimensions  is  available,  but  by  the 
translation  rules  that  map  stimuli  to  responses.  In  terms  of  the  model  we 
presented  in  Figure  1,  these  strategic  effects  can  be  conceptualized  in 
terms  of  adjustments  in  the  way  in  which  stimulus  evaluation  and  response 
systems  communicate  (Logan,  1980). 

4.  Oiscussion 

The  data  reviewed  In  the  preceding  sections  suggest  that,  under  at 
least  some  conditions,  the  readiness  potential  measures  detect  the  presence 
of  early  communication.  Different  aspects  of  the  stimulus  Information 
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produce  distinguishable  effects  on  the  response  system.  In  the  conflict 
paradigm,  the  incorrect  response  can  be  activated  first  at  a  subthreshold 
level  as  a  result  of  preliminary  stimulus  evaluation.  Later  on,  the  correct 
response  is  activated  at  a  threshold  level  as  a  result  of  "complete" 
evaluation.  In  the  congruence  paradigm,  the  correct  response  can  be 
activated  first  at  a  subthreshold  level  by  partial  evaluation,  and  later  at 
a  threshold  level  by  complete  evaluation. 

Early  communication  is  not  predicted  by  models  that  postulate  a  single 
decision  at  the  interface  between  the  stimulus  evaluation  and  the  response 
systems  (Sternberg,  1969).  In  fact,  such  models  assume  that  information  is 
transmitted  between  different  elements  of  the  information  processing  system 
in  a  discrete,  all-or-none  fashion.  In  some  of  the  experiments  we  have 
described,  information  is  clearly  not  transmitted  in  a  single  chunk.  In 
this  sense,  the  data  are  compatible  with  continuous  flow  models  (Eriksen  & 
Schultz,  1979;  Grice  et  al.,  1982),  but  they  are  also  compatible  with  the 
asynchronous  discrete  model  of  Miller  (1982).  Although  the  form  of  the 
lateralized  readiness  potential  suggests  a  continuous  modulation  of  the 
level  of  response  activation,  this  continuity  maybe  an  artifact  of  the 
averaging  process.  A  small  number  of  discrete  "quanta"  of  activations  of 
the  correct  and  incorrect  response  may  produce  the  pattern  of  lateralization 
data  we  observe  in  the  conflict  and  congruence  paradigms.  In  fact,  at  the 
present  time,  we  have  no  way  to  distinguish  between  models  that  propose 
continuous  communication  and  those  that  propose  communication  in  multiple 
chunks. 

The  possibility  of  early  conmunication  means  that  the  response  system 
can  be  sensitive  to  intermediate  phases  of  the  stimulus  evaluation  process. 
Therefore,  measures  of  partial  response  activation  provided  by  the 
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lateralized  readiness  potential  can  be_used  to  investigate  how  stimulus 
information  is  analyzed.  For  example,  the  latency  of  the  lateralization  of 
the  readiness  potential  could  be  used  as  an  index  of  the  timing  of 
intermediate  evaluation  phases  (see  Buckley  et  al.,  1988;  De  Jong  et  al.,  in 
press;  Gratton  et  al.,  in  press).  When  lateralization  is  observed, 
following  stimulus  presentation,  the  subject  presumably  has  access  to  a 
particular  aspect  of  the  stimulus  information. 

Even  in  those  cases  in  which  the  lateralization  of  the  readiness 
potential  does  not  reveal  the  presence  of  early  comnun icat ion,  this  measure 
can  be  used  to  make  inferences  about  the  strategies  used  by  the  subject  to 
perform  the  task.  We  have  seen  an  example  of  this  in  the  De  Jong  et  al.  -(4* 
press)  study.  Note  that  to  use  the  lateralization  data  in  this  way  it  is 
essential  to  demonstrate  that  the  different  dimensions  of  stimulus 
information  manipulated  (a)  can  be  processed  independently,  and  (b)  activate 
responses  at  different  times.  Only  when  it  is  possible  to  demonstrate  that 
the  subject  might  have  performed  the  task  differently,  can  it  be  concluded 
that  strategic  choices,  rather  than  cognitive  limitations,  are  responsible 
for  the  failure  to  detect  early  communication. 

The  apparent  sensitivity  of  the  lateral ized  readiness  potential  to 
subthreshold  response  activation  suggests  a  number  of  other  situations  in 
which  this  measure  can  be  used  to  understand  the  Information  processing 
system.  For  example,  it  could  be  used  to  determine  whether  a  particular 
interference  effect  involves  subthreshold  activation  of  inappropriate 
responses,  or  is  instead  localized  at  more  central  levels  of  the  information 
processing  system.  Questions  about  the  locus  of  different  stimulus-response 
compatibility  effects  (e.g.,  Magliero,  Bashore,  Coles,  &  Donchin,  1984; 
Ragot,  1984)  may  be  addressed  in  this  way.  Furthermore,  the  onset  of  the 
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lateral i nation  of  the  readiness  potential  can  be  used  as  a  marker  for  the 
occurrence  of  a  particular  internal  event.  In  fact,  the  information 
processing  system  must  have  distinguished  between  two  alternative  hypotheses 
about  the  stimulus  by  the  time  the  lateralization  appears. 

In  general,  then,  the  lateralization  measure  can  be  used,  in 
conjunction  with  other  psychophysiolog ical  measures  (EMG  onset,  P300 
latency,  etc.)  and  measures  of  overt  behavior  (reaction  time,  accuracy, 
etc.)  to  explore  the  complexities  of  human  information  processing.  In  this 
paper,  we  have  described  one  way  in  which  the  lateralization  of  the 
readiness  potential  can  be  used,  namely  in  the  detection  of  "early 
conmunication. "  — 
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Footnotes- _ 

1.  When  the  warning  stimulus  predicted  the  imperative  stimulus  with  a 
probability  of  .8,  the  average  value  of  lateralization  for  the  last  300  ms 
of  the  foreperiod  (-0.68  microvolts)  was  significantly  less  than  0, 
t(5)=-3.38,  £<.05.  When  the  warning  stimulus  had  no  predictive  value,  the 
average  lateralization  (-0.18  microvolts)  was  not  significantly  different 
from  0,  t(5)=-1.30. 

2.  The  average  activity  in  the  interval  between  170  and  210  ms  after 
array  presentation  was  significantly  positive,  _t(5)=2.78,  £<.05. 

3.  The  average  activity  in  the  interval  between  310  and  350  ms  after 
the  presentation  of  the  second  word  was  significantly  more  negative  in  the 
no-conflict  than  in  the  conflict  condition,  -0.31  vs.  0.09  microvolts, 
t(9)=-2.78,  £<.05.  However,  the  average  activity  for  the  conflict  condition 
(the  "dip”)  visible  in  Figure  7  was  not  significantly  different  from  0, 
t(9)=0.71.  Note  that  in  this  task,  as  compared  to  the  noise-compatibility 
task,  there  was  considerably  more  variability  in  reaction  time.  This 
implies  that  separate  averages  should  be  computed  for  trials  with  different 
response  latencies  (as  we  did  in  Gratton  et  al . ,  in  press).  However,  this 
was  not  possible,  since  only  a  limited  number  of  word  pairs  could  be 
obtained  for  the  "conflict"  condition  (a  total  of  110).  Therefore,  the 
power  of  this  experiment  was  considerably  less  than  that  of  the  other 
experiments  described  in  this  paper,  and  the  lack  of  a  significant  result  is 
certainly  not  surprising.  Note,  however,  the  consistency  of  the  waveforms 
obtained  for  the  "conflict"  and  "no  conflict"  conditions  with  those  obtained 
in  comparable  conditions  of  the  noise-compatibility  task. 
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Figure  Legends 

F iqure  L  Schematic  representation  of  stimulus  evaluation  and  response 
activation  systems  and  the  concept  of  "early  communication."  Each  system 
may  pass  through  intermediate  states  (levels  of  activation).  The  evaluation 
system  may  or  may  not  pass  information  to  the  response  activation  system 
before  stimulus  evaluation  is  completed. 

Figure  2.  Lateralized  readiness  potentials  time-locked  to  the  motor 
response,  or  to  the  respond  stimulus,  for  three  different  conditions. 
Voluntary:  the  subject  produced  self-paced  squeezes.  Warned:  the  subject 
responded  to  the  respond  stimulus  (RS)  in  a  choice  reaction  time  when  the— 
warning  stimulus  (WS)  provided  information  about  the  hand  to  be  used. 

Choice  Warned:  the  subject  responded  to  the  respond  stimulus  (RS)  in  a 
choice  reaction  time  task  when  the  warning  stimulus  (WS)  did  not  provide 
information  about  the  hand  to  be  used.  (Copyright  1980,  Elsevier  Science 
Publishers.  Adapted  with  permission  from  the  author  and  publisher  from 
Kutas  &  Donchin,  1980). 

Figure  3.  The  Conflict  Paradigm.  The  subject  receives  conflicting 
information  (information  1  and  information  2  are  associated  with  different 
responses).  Preliminary  evaluation  (based  on  information  1)  provides 
initial  evidence  for  the  incorrect  response.  Complete  evaluation  (based  on 
information  2)  provides  evidence  for  the  correct  response.  Correct  and 
incorrect  responses  are  assigned  to  different  hands.  Thus,  a  reversal  in 
lateralization  of  the  readiness  potential  will  reveal  the  presence  of  early 


communication. 
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F igucfi.  4.  SI  indicates  the  time  of  presentation  of  the  warning 
stimulus;  S2  indicates  when  the  imperative  stimulus  was  presented.  Upward 
deflections  indicate  greater  negativity  at  the  scalp  site  contralateral  to 
the  same  response  -  and  can  be  considered  to  reflect  activation  of  the 
"same"  response.  Panel  A  presents  the  lateralization  data  for  the  condition 
in  which  the  precue  predicted  the  imperative  stimulus  with  a  probability  of 
.8.  The  solid  line  refers  to  trials  in  which  the  predicted  imperative 
stimulus  (“same")  actually  occurred,  while  the  dashed  lines  refers  to  trials 
in  which  the  unpredicted  imperative  stimulus  ("different")  was  presented. 
Panel  B  presents  comparable  data  for  the  condition  in  which  the  precue  did 
not  hava  any  predictive  value.  (From  Bosco  et  al.,  1986.) 

F iqure  5.  To£  panel.  EMG  response  accuracy  is  plotted  as  a  function 
of  EMG  response  latency  for  compatible  and  incompatible  conditions.  Middle 
panel .  Lateralization  values  for  the  two  compatibility  conditions.  Upward 
deflections  are  given  by  greater  negativity  at  the  electrode  site 
contralateral  to  the  correct  response  and  thus  indicate  correct  response 
activation.  Oownward  deflections  indicate  incorrect  response  activation. 
Bottom  panel .  Lateralization  values  for  a  subset  of  trials  on  which  (a)  EMG 
latency  was  300-349  msec,  and  (b)  the  first  discernible  EMG  response  was 
with  the  correct  hand.  (Copyright  1988,  American  Psychological  Association. 
Reprinted  with  permission  from  the  author  and  publisher  from  Gratton  et  al., 
in  press). 
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F igure  6.  Lateral i zation  of  the  readiness  potential  for  different 
conditions  of  the  noise/compatibility  experiment.  In  the  No  Noise 
condition,  target  letters  were  presented  alone.  In  the  Neutral  Noise 
condition,  noise  letters  were  not  associated  with  any  experimentally  defined 
response.  For  Compatible  and  Incompatible  Noise  conditions,  noise  letters 
were  those  that  called  for  the  same  or  the  opposite  response  as  the  target. 
(Adapted  with  permission  of  the  author  from  Smid  et  al.,  in  preparation.) 

Figure  ]_.  Lateralization  data  averaged  over  "yes"  and  "no"  responses, 
when  conflict  was  or  was  not  present  for  the  rhyme  judgment  task.  Upward 
deflections  correspond  to  greater  negativity  at  the  scalp  site  contralateral 
to  the  correct  response.  Averages  are  based  on  all  correct  response  trials 
regardless  of  response  latency.  There  were  insufficient  data  to  segregate 
trials  as  a  function  of  reaction  time.  To£  Panel .  Data  from  the  Rhyme 
Task.  Bottom  Panel .  Data  from  the  Visual  Task. 

Figure  8.  The  Congruence  Paradigm.  The  subject  receives  congruent 
information  (information  1  and  information  2  are  associated  with  the  same 
response).  Preliminary  evaluation  (based  on  information  1)  provides  partial 
evidence  for  the  correct  response.  Complete  evaluation  (based  on 
information  2)  provides  complete  evidence  for  the  correct  response.  A  "two- 
step"  increase  in  lateralization  of  the  readiness  potential  will  reveal  the 
presence  of  early  communication. 
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1.  Introduction 

Ever  since  Berger  (1929)  demonstrated  that  it  is  possible  to 
record  the  electrical  activity  of  the  brain  by  placing  electrodes 
on  the  surface  of  the  scalp,  there  has  been  considerable  interest 
in  the  relationship  between  these  recordings  and  psychological 
processes.  While  Berger  and  his  followers  focussed  their 
attention  on  spontaneous  rhythmic  oscillations  in  voltage  -  that 
is,  on  the  electroencephalogram  or  EEG  (see  chapter  14)  -  more 

recent  research  has  concentrated  on  those  aspects  of  the 
electrical  potential  that  are  specifically  time-locked  to  events 
-  that  is,  on  event-related  potentials  or  ERPs.  The  ERPs  are 
regarded  as  manifestations  of  brain  activities  that  occur  in 
preparation  for  or  in  response  to  discrete  events,  be  they 
internal  or  external  to  the  subject.  Conceptually,  the  ERPs  are 
regarded  as  manifestations  of  specific  psychological  processes. 
Later  in  this  chapter,  we  shall  review  what  is  known  about  their 
underlying  sources  and  their  relationship  to  physiological 
function  (Section  2) .  However,  we  shall  focus  for  the  most  part 
on  the  relationship  between  the  potentials  and  psychological 
function  (Sections  3,  4,  and  5). 

l.l  Deriving  event-related  potentials 

The  procedures  used  to  derive  ERPs  begin  with  the  same 
amplifiers  and  filters  used  to  obtain  EEG  (see  Figure  1) . 
Electrodes  are  attached  to  the  scalp  at  various  locations  and 
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connected  to  amplifiers.  The  locations  are  usually  chosen 
according  to  the  International  10-20  system  (Jasper,  1958) ,  such 
that  between-laboratory  and  between-experiment  comparisons  are 
possible.  The  outputs  of  the  amplifiers  are  converted  to  numbers 
by  a  device  for  measuring  electrical  potentials,  an  analog- 
digital  converter.  The  potentials  are  sampled  at  a  frequency 
ranging  from  100  to  10000  Hz  (cycles  per  second)  and  may  be 
stored  for  subsequent  analysis. 

The  ERP  is  small  (a  few  microvolts)  in  comparison  to  the  EEG 
(about  50  microvolts) .  Thus,  the  analysis  generally  begins  with 
a  procedure  to  increase  the  discrimination  of  the  "signal"  (the 
ERP)  from  the  "noise"  (background  EEG) .  The  most  common 
procedure  involves  averaging  samples  of  the  EEG  that  are  time- 
locked  to  repeated  occurrences  of  a  particular  event.  The  number 
of  samples  used  in  the  average  will  depend  on  the  signal-to-noise 
ratio.  However,  in  all  cases  the  samples  are  selected  so  as  to 
bear  a  constant  temporal  relationship  to  an  event.  Since  all 
those  aspects  of  the  EEG  that  are  not  time-locked  to  the  event 
are  assumed  to  vary  randomly  from  sample  to  sample,  the  averaging 
procedure  should  result  in  a  reduction  of  these  potentials 
leaving  the  event-related  potentials  visible.1  The  resulting 

1  Note  that  this  assumption  may  not  always  be  valid,  as,  for 
example,  in  the  case  of  variability  in  the  latency  and  other 
characteristics  of  the  ERP  from  sample  to  sample  (see  3.2.2). 
Furthermore,  the  ERP  derived  by  averaging  may  include  potentials 
that  do  not  originate  in  the  brain  but  are  time-locked  to  the 
event  (see  3.2.1) . 
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voltage  x  time  function  (see  Figure  2)  contains  a  number  of 
positive  and  negative  peaks  which  are  then  subjected  to  a  variety 
of  measurement  operations  (see  3.2).  These  peaks  are  generally 
described  in  terms  of  their  characteristic  polarity  and  latency. 
Thus,  P300  refers  to  a  positive  peak  with  a  modal  latency  of  300 
ms.  Other  descriptors  can  include  reference  to  the  psychological 
or  experimental  conditions  that  control  the  potential  (e.g. 
readiness  potential,  mismatch  negativity)  and  to  the  scalp 
location  at  which  the  potential  is  maximal  (e.g.  frontal  P300) . 
Note  that  each  peak  in  the  ERP  waveform  is  usually  associated 
with  a  particular  distribution  across  the  scalp.  Thus,  spatial 
(topographic)  distribution  is  regarded  as  an  important 
discriminative  characteristic  of  the  ERP  (Donchin,  1978;  Sutton  & 
Ruchkin,  1984).  The  relationship  between  spatial  distribution 
and  underlying  brain  activity  will  be  discussed  in  Section  2, 
while  Section  3  provides  a  more  thorough  description  of  the 
methods  used  to  extract  and  measure  ERPs.  Section  4  reviews  some 
of  the  more  commonly  measured  components. 

1.2  The  endogenous  versus  exogenous  distinction 

From  a  psychological  point  of  view,  it  is  convenient  to 
distinguish  between  different  types  of  ERPs.  First  we  can 
identify  those  ERPs  whose  characteristics  are  mostly  controlled 
by  the  physical  properties  of  an  external  eliciting  event.  Such 
potentials  are  considered  to  be  obligatory  and  are  referred  to  as 
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"sensory-"  or  "exogenous."  Second,  we  can  identify  ERPs  whose 
characteristics  are  determined  more  by  the  nature  of  the 
interaction  between  the  subject  and  the  event.  For  example,  some 
ERPs  vary  as  a  function  of  the  information  processing  activities 
required  of  the  subject;  others  can  be  elicited  in  the  absence  of 
an  external  eliciting  event.  These  potentials  are  referred  to  as 
"endogenous".  Naturally,  it  is  the  endogenous  potentials  that 
are  the  focus  of  those  researchers  interested  in  cognitive 
function.  (For  a  discussion  of  the  distinction  between  exogenous 
and  endogenous  potentials,  see  Donchin,  Ritter,  and  McCallum, 


1978)  . 
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2.  Physiological  Basis  of  ERPs 

In  this  section,  we  review  evidence  that  relates  the  scalp- 
recorded  electrical  activity  to  its  underlying  anatomical  and 
physiological  basis.  For  a  detailed  review  of  this  relationship, 
see  Nunez  (1981),  or  Allison,  Wood,  and  McCarthy  (1986). 


2.1  Proa  tha  brain  to  tha  scalp:  tha  ganaration  of  ERPs 

It  is  generally  assumed  that  ERPs  are  distant  manifestations 
of  the  activity  of  populations  of  neurons  within  the  brain.  This 
activity  can  be  recorded  on  the  surface  of  the  scalp  because  the 
tissue  that  lies  between  the  source  and  the  scalp  acts  as  a 
volume-conductor.  Since  the  electrical  activity  associated  with 
any  particular  neuron  is  small,  it  is  only  possible  to  record  at 
the  scalp  the  integrated  activity  of  a  large  number  of  neurons. 
Two  requirements  must  be  met  for  this  integration  to  occur:  (a) 
the  neurons  must  be  active  synchronously,  and  (b)  the  electric 
fields  generated  by  each  particular  neuron  must  be  oriented  in 
such  a  way  that  their  effects  at  the  scalp  cumulate.  As  a 
consequence,  only  a  subset  of  the  entire  brain  electrical 
activity  can  be  recorded  from  scalp  electrodes. 

Two  considerations  further  restrict  the  likely  sources  of 
the  ERP.  First,  since  the  ERP  represents  the  synchronous 
activity  of  a  large  number  of  neurons,  it  is  probably  not  due  to 
the  summation  of  pre-synaptic  potentials  (spikes) ,  because  these 
potentials  have  a  very  high  frequency  and  short  duration.  On  the 
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other  hand-,  post-synaptic  potentials,  having  a  relatively  slower 
time  course,  are  more  likely  to  be  synchronous,  and  therefore  to 
summate  to  produce  scalp  potentials.  Thus,  it  is  commonly 
believed  that  most  scalp  ERPs  are  the  summation  of  the 
post-synaptic  potentials  of  a  large  number  of  neurons  that  are 
activated  (or  inhibited)  synchronously  (see  Allison,  Wood,  & 
McCarthy,  1986)  . 

A  second  consideration  concerns  the  orientation  of  neuronal 
fields.  Since  the  electric  fields  associated  with  the  activity 
of  each  individual  neuron  involved  must  be  oriented  in  such  a  way 
as  to  cumulate  at  the  scalp,  only  neural  structures  with  a 
specific  spatial  organization  may  generate  scalp  ERPs.  Lorente 
de  No  (1947)  specified  the  spatial  organizations  that  are 
required  for  the  distant  recording  of  the  electrical  activity  of 
a  neural  structure.  He  distinguished  between  two  types,  "open 
fields"  and  "closed  fields"  (see  Figure  3). 

A  structure  having  an  open  field  organization  is 
characterized  by  neurons  that  are  ordered  so  that  their  dendritic 
trees  are  all  oriented  on  one  side  of  the  structures,  while  their 
axons  all  depart  from  the  other  side.  In  this  case,  the  electric 
fields  generated  by  the  activity  of  each  neuron  will  all  be 
oriented  in  the  same  direction  and  summate.  Only  structures  with 
some  degree  of  open  field  organization  generate  potentials  that 
can  be  recorded  at  the  scalp.  Open  fields  are  obtained  whenever 
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neurons  are  organized  in  layers,  as  in  most  of  the  cortex,  parts 
of  the  thalamus,  the  cerebellum,  and  other  structures. 

A  structure  with  a  closed  field  organization  is 
characterized  by  neurons  that  are  concentrically  or  randomly 
organized.  In  both  cases,  the  electric  fields  generated  by  each 
neuron  will  be  oriented  in  very  different,  sometimes  opposite, 
directions,  and  therefore  will  cancel  each  other.  Examples  of 
closed  field  organization  are  given  by  some  midbrain  nuclei. 

From  this  analysis  it  is  clear  that  ERPs  represent  just  a 
sample  of  the  brain  electrical  activity  associated  with  a  certain 
event.  Thus,  it  is  entirely  possible  that  a  sizeable  portion  of 
the  information  processing  transactions  that  occur  after  (or 
before)  the  anchor  event  are  silent  as  far  as  ERPs  are  concerned. 
For  this  reason,  some  caution  should  be  used  in  the 
interpretation  of  ERP  data.  For  instance,  if  an  experimental 
manipulation  has  no  effect  on  the  ERP,  we  cannot  conclude  that  it 
does  not  influence  brain  processes. 

2.2  Fro*  the  scalp  to  the  brain:  inferring  the  sources  of  ERPs 
So  far  we  have  examined  how  particular  properties  of 
neuronal  phenomena  may  determine  whether  they  will  be  recorded  at 
the  scalp.  We  have  approached  the  problem  of  ERP  generation  in  a 
direct  fashion,  from  properties  of  the  generators  to  predictable 
scalp  observations.  In  most  cases,  however,  we  have  only  limited 
information  about  the  neural  structure (s)  responsible  for  a 


T 


~YT~ 


Event-Related  Brain  Potentials 

Page  8 

speci f ic - aspect  of  the  ERP.  Our  database  consists  of 
observations  of  voltage  differences  between  scalp  electrodes  or 
between  scalp  electrodes  and  a  reference  electrode.  To  determine 
which  neural  structures  are  responsible  for  the  scalp  potential  - 
that  is,  to  identify  the  neural  generators  of  ERPs  -  we  must 
solve  the  inverse  problem.  We  have  to  infer  the  unique 
combination  of  neural  generators  whose  activity  results  in  the 
potential  observed  at  the  scalp. 

In  solving  this  problem,  we  are  confronted  with  an 
indefinite  number  of  unknown  parameters.  In  fact,  an  indefinite 
number  of  neural  generators  may  be  active  simultaneously,  and 
each  of  them  may  vary  in  amplitude,  orientation  of  the  electric 
field,  and  location  inside  the  head.  Since  a  limited  number  of 
observations  (the  voltage  values  recorded  at  different  scalp 
electrodes)  is  used  to  estimate  an  indefinite  number  of 
parameters,  it  is  clear  that  the  inverse  problem  does  not  have  a 
unique  solution.  A  further  complication  is  that  the  head  is  not 
a  homogeneous  medium.  Therefore  the  electric  field  generated  by 
the  activity  of  particular  structures  is  difficult  to  compute.  A 
particularly  important  distortion  of  the  electric  fields  is 
caused  by  the  skull  -  a  very  low  conductance  medium  that  reduces 
and  smears  electric  fields.  For  all  these  reasons,  we  cannot 
determine  unequivocally  which  structures  are  responsible  for  the 
ERP  observed  at  any  point  in  time,  when  the  only  information 
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available  is  given  by  the  potentials  recorded  at  scalp 
electrodes . 

In  spite  of  these  problems,  investigators  have  tried  to 
identify  the  neural  sources  of  the  scalp  ERP  using  a  variety  of 
approaches,  involving  both  non-invasive  and  invasive  techniques. 
Non-invasive  techniques  are  based  on  scalp  recordings.  They 
involve  complex  mathematical  procedures  and  depend  on  a  number  of 
restrictive  assumptions.  Invasive  techniques  are  based  on 
recordings  from  indwelling  electrodes  (in  humans  or  in  animals) 
or  on  lesion  data. 

A  non-invasive  approach  to  the  inverse  problem  is  to 
generate  several  alternative  hypotheses  about  neural  structures 
that  may  be  active  at  a  particular  point  in  time,  and  that  may  be 
responsible  for  an  observed  scalp  ERP.  The  distribution  of 
potentials  across  the  scalp  that  would  be  generated  by  each  of 
these  structures  can  then  be  computed  using  a  direct  approach. 
Finally,  the  structure  whose  activity  best  accounts  for  the 
observed  scalp  distribution  can  be  identified.  This  approach 
requires  that  specific  neurophysiological  knowledge  exists  about 
candidate  underlying  structures.  First,  we  must  have  some  reason 
to  restrict  the  number  of  candidate  neural  generators  to  a 
manageable  number.  Second,  we  must  have  sufficient  knowledge  of 
the  anatomy  and  physiology  of  each  of  these  candidate  neural 
generators  to  be  able  to  compute  the  distributions  of  the  scalp 
potentials  associated  with  their  activity.  At  present,  such 
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knowledge  exists  only  in  a  very  limited  number  of  cases  (see,  for 
example,  Scherg  &  von  Cramon,  1985) . 

A  more  sophisticated  and  promising  non-invasive  procedure 
involves  the  use  of  magnetic  field  recordings.  Magnetic  fields 
generated  by  brain  activity  are  extremely  small  in  relation  to 
magnetic  fields  generated  by  environmental  and  other  bodily 
sources.  Therefore,  their  measurement  is  both  difficult  and 
expensive.  The  advantage  of  measuring  the  magnetic  field  is  that 
it  is  practically  insensitive  to  variations  of  the  conductive 
media  (such  as  those  due  to  the  presence  of  the  skull) .  It  is 
therefore  much  easier  to  compute  the  source  of  a  particular 
field.  An  in-depth  discussion  of  the  problems  and  peculiarities 
of  the  magnetic  technique  is  beyond  the  scope  of  this  chapter  and 
can  be  found  elsewhere  (Beatty,  Barth,  Richer,  &  Johnson,  1986) . 
We  will  only  note  here  that  using  the  magnetic  technique  to 
determine  the  source  of  neural  components  still  requires 
assumptions  about  the  number  of  neural  structures  active  at  a 
particular  moment  in  time. 

Invasive  techniques  for  the  identification  of  the  sources  of 
ERP  components  are  based  on  the  implantation  of  electrodes  within 
the  brain  of  humans  or  animals.  Research  on  humans  has  been  made 
possible  by  the  need  for  recording  EEG  activity  in  deep  regions 
of  the  brain  for  diagnostic  purposes  (Halgren  et  al.,  1980;  Wood 
et  al.,  1984).  A  problem  with  the  human  research  is  that  the 
indwelling  electrodes  are  located  according  to  clinical,  rather 
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than  scientific  criteria,  and  therefore  may  fail  to  map  the 
regions  involved  in  the  generation  of  scalp  ERPs.  This  may  be 
solved  by  research  on  animals  (Buchwald  &  Squires,  1983;  Csepe, 
Karmos,  &  Molnar,  1987;  Starr  4  Farley,  1983).  However,  animal 
research  is  problematic  because  it  is  difficult  to  determine 
whether  the  ERP  observed  in  animals  corresponds  to  that  observed 
in  humans.  This  is  because  of  fundamental  differences  in  the 
anatomy  of  animal  and  human  brains.  Finally,  a  general  problem 
with  depth  recording  is  that  it  is  difficult  to  know  the  extent 
to  which  the  scalp  recorded  ERP  is  due  to  the  activity  of  the 
structures  that  have  been  identified  by  the  indwelling 
electrodes.  This  problem  can  be  resolved  in  animal  research  if 
lesions  in  the  structure  identified  as  the  candidate  generator 
result  in  elimination  of  the  scalp  potential. 

In  summary,  although  several  techniques  have  been  used  for 
identifying  the  source  of  ERP  components,  none  of  them  appears  to 
be  likely  to  give  definitive  answers  in  all  cases.  However,  the 
convergence  of  several  techniques  may  provide  useful  information 
about  the  neural  structures  whose  activity  is  manifested  at  the 
scalp  by  the  ERP. 
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3.  The  Inferential  Context 

In  this  section,  we  review  the  process  through  which  we  come 
to  make  inferences  about  psychological  processes  and  states  from 
the  measurement  of  ERPs.  To  begin  with,  however,  we  need  to 
address  a  number  of  assumptions  about  the  "meaning"  of  the  ERP 
and  a  variety  of  measurement  issues. 

3.1  The  concapt  of  conponants 

As  we  noted  above,  the  ERP  can  be  described  as  a  voltage  x 
time  function.  We  assume  that  the  various  voltage  fluctuations 
represented  by  this  function  reflect  the  activities  of  neuronal 
populations  and  that,  in  turn,  these  neuronal  populations  are 
responsible  for  the  execution  of  some  psychological  process.  In 
practice,  the  tendency  in  cognitive  psychophysiology  has  been  to 
focus  on  processes  identified  by  cognitive  psychologists  as 
candidate  psychological  processes. 

The  total  ERP  is  assumed  to  be  a  manifestation  of  the 
aggregate  of  a  number  of  ERP  "components".  The  components  can  be 
defined  in  three  different  ways  (Fabiani,  Gratton,  Karis,  & 
Donchin,  in  press;  Naatanen  &  Picton,  1987) .  First,  components 
can  be  defined  in  terms  of  the  peaks  and  troughs  (maxima  and 
minima)  that  are  observed  in  the  ERP  trace.  Second,  components 
can  be  defined  as  aspects  of  the  ERP  waveforms  that  are 
functionally  associated,  that  is,  they  covary  across  subjects  or 
conditions  or  location  on  the  scalp.  Third,  components  can  be 
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defined  -in  terms  of  those  neural  structures  that  generate  them. 
These  definitions  may  converge  in  some  circumstances.  However, 
as  Naatanen  and  Picton  (1987)  have  indicated,  a  peak  in  the  ERP 
waveform  (for  example,  the  Nl)  may  represent  the  summation  of 
several  functionally  and  structurally  distinct  components.  Thus, 
the  adoption  of  one  or  another  of  these  definitions  will  have 
important  consequences  for  the  interpretation  of  the  component 
structure  of  the  ERP  waveform.  A  corollary  of  this  is  that 
different  measurement  procedures  will  be  required  depending  on 
the  type  of  component  definition  that  is  adopted.  These 
procedures  will  be  reviewed  in  subsequent  sections  after  a  brief 
discussion  of  general  measurement  issues. 

3.2  General  measurement  Issues 

3.2.1  Artifacts.  The  potential  recorded  at  the  scalp  can 
be  influenced  by  sources  of  electrical  activity  that  do  not  arise 
from  the  brain.  Examples  of  these  sources  of  artifacts  include 
the  movement  of  eyeballs  and  eyelids,  tension  of  the  muscles  in 
the  head  and  neck,  and  the  electrical  activity  generated  by  the 
heart.  These  artifacts  can  be  dealt  with  in  the  following  ways. 
First,  one  can  set  up  the  recording  situation  so  that  artifacts 
are  minimized.  This  can  be  accomplished  by  suitable  choice  of 
electrode  locations  and  of  the  subject's  task  and  environment. 
Second,  one  can  simply  discard  records  which  contain  artifacts. 
Unfortunately,  this  procedure  may  lead  to  a  bias  in  the  selection 
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of  the  observations  and/or  subjects.  Third,  one  can  use  filters 
(see  below)  to  attenuate  artifactual  activity.  This  procedure  is 
useful  when  the  frequency  of  the  artifactual  activity  is  outside 
the  frequency  range  of  the  ERP  signal  of  interest.  For  example, 
the  frequency  of  electromyographic  activity  is  higher  than  that 
of  most  endogenous  ERP  components.  Fourth,  one  can  attempt  to 
measure  the  extent  of  the  artifact  and  then  remove  it  from  the 
data.  This  procedure  has  been  used  most  frequently  in  the  case 
of  ocular  artifacts  (e.g.  Gratton,  Coles,  &  Donchin,  1983) . 

3.2.2  signal-to-noise  ratio.  The  ERP  consists  of  a  series 
of  fluctuations  in  voltage  that  are  time-locked  to  an  event. 

These  voltage  changes  are  typically  small  (a  few  microvolts)  in 
relation  to  the  background  EEG  (about  50  microvolts)  in  which 
they  are  imbedded.  Thus,  a  major  measurement  problem  concerns 
the  extraction  of  the  ERP  signal  from  the  background  noise. 

Several  procedures  have  been  advocated  to  increase  the 
signal-to-noise  ratio,  including  filtering,  averaging,  and 
pattern  recognition  (see  Coles,  Gratton,  Kramer,  &  Miller,  1986, 
for  a  more  detailed  discussion) . 

Filtering  involves  the  attenuation  of  noise,  whose  frequency 
is  different  from  that  of  the  signal.  For  example,  most 
endogenous  components  have  frequencies  of  between  0.5  and  20  Hz. 
Thus,  at  the  time  of  recording  or  later  at  the  time  of  analysis, 
analog  or  digital  filters  can  be  used  to  attenuate  activity 
outside  this  frequency  range.  Great  care  should  be  taken  in  the 
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selection  -of  filters.  The  amplitude  and  latency  of  an  ERP 
component  (as  well  as  the  general  ERP  waveform)  will  be  distorted 
if  the  bandpass  of  the  filter  excludes  frequencies  of  interest 
(see  Figure  4). 

Averaging  involves  the  summation  of  a  series  of  EEG  epochs 
(or  trials) ,  each  of  which  is  time-locked  to  the  event  of 
interest.  These  EEG  epochs  are  assumed  to  be  given  by  two 
sources:  first,  the  ERP,  and  second,  other  voltage  fluctuations 
that  are  not  time-locked  to  the  event.  Since,  by  definition, 
these  other  fluctuations  are  random  with  respect  to  the  event, 
they  should  average  to  zero,  leaving  the  time-locked  ERP  both 
visible  and  measurable.  If  it  is  the  case  that  (a)  the  ERP 
"signals"  are  constant  over  trials,  (b)  the  noise  is  random 

across  trials,  and  (c)  that  the  ERP  "signals"  are  independent  of 

the  background  noise,  then  the  signal-to-noise  ratio  will  be 
increased  by  the  square  root  of  the  number  of  trials  included  in 

the  average.  Note  that  the  utility  of  the  averaging  procedure 

also  depends  on  the  fact  there  are  no  correlated  signals  (such  as 
the  EOG  -  see  3.2.1)  that  are  also  time-locked  to  the  event. 

One  of  the  problems  with  the  averaging  procedure  is  that  the 
three  assumptions  described  in  the  previous  paragraph  may  not 
always  be  satisfied  in  the  typical  experiment.  In  particular,  if 
the  latency  of  the  ERP  varies  from  trial  to  trial  (latency 
jitter),  the  average  ERP  waveform  will  not  be  representative  of 
the  actual  ERP  of  any  individual  trial.  A  related  problem  is 
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that  the-  investigator  may  be  interested  in  measures  of  the  ERP  on 
individual  trials.  Thus,  a  major  thrust  in  ERP  methodology  has 
been  to  derive  procedures  for  single-trial  analysis. 

Pattern  recognition  techniques  allow  the  investigator  to 
identify  segments  of  the  EEG  epoch  that  contain  specific 
features,  such  as  a  particular  pattern  of  pea)c  and  troughs 
characteristic  of  an  ERP  component.  The  advantage  of  these 
techniques  is  that  they  allow  the  investigator  to  identify  and 
measure  components  on  individual  trials.  Examples  of  pattern 
recognition  techniques  are  cross-correlation,  Woody  filter 
(Woody,  1967) ,  and  step-wise  discriminant  analysis  (Donchin  & 

Herning,  1975;  Horst  &  Donchin,  1980;  Squires  &  Donchin,  1976) . 

I 

In  the  case  of  cross-correlation  and  Woody  filter,  the  individual 
trial  epoch  is  scanned  to  determine  the  segment  that  best 
resembles  an  " ideal”  template  corresponding  to  the  component  of 
interest.  In  the  case  of  discriminant  function,  the  procedure 
begins  with  the  selection  of  two  sets  of  waveforms,  that  are 
presumed  to  differ  in  terms  of  a  specific  component.  Then, 
features  are  identified  (in  the  form  of  numerical  weights)  that 
best  discriminate  between  the  two  sets  of  waveforms.  These 
features  are  considered  to  represent  the  defining  characteristic 
of  the  component  and  individual  trials  can  be  examined  to 
determine  the  extent  to  which  the  features  are  present.  Note 
that  the  cross-correlation  and  Woody  filter  procedures  can  yield 
both  amplitude  and  latency  estimates  for  individual  trials,  while 
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discriminant  function  procedures  yield  only  amplitude  estimates. 
(For  a  general  discussion  of  pattern  recognition  techniques,  see 
Glaser  &  Ruchkin,  1976.) 

3.3  Component  quantification 

In  this  section,  we  describe  procedures  that  have  been  used 
to  quantify  ERP  components.  As  mentioned  earlier,  these 
measurement  operations  will  depend  on  the  way  in  which  ERP 
components  are  defined. 

3.3.1  Peak  measurement.  As  indicated  above,  components  can 
be  defined  in  terms  of  peaks  or  troughs  having  characteristic 
polarities  and  latency  ranges.  Thus,  the  measurement  operation 
involves  the  assessment  of  either  amplitude  of  the  peak  in 
microvolts,  or  the  assessment  of  its  latency  in  msec  (see  Figure 
5) .  Amplitude  is  usually  referred  either  to  the  baseline,  pre¬ 
event,  voltage  level  (base-to-peak  amplitude)  or  to  some  other 
peak  in  the  ERP  waveform  (peak-to-peak  amplitude) .  Latency  is 
referred  to  the  time  of  occurrence  of  the  event. 

When  the  component  under  analysis  does  not  have  a  definite 
peak,  it  is  customary  to  measure  the  integrated  activity  across  a 
particular  latency  range  (area  measure). 

3.3.2  Covariation  measures.  As  mentioned  above,  components 
can  also  be  defined  in  terms  of  segments  of  the  ERP  waveform  that 
exhibit  covariation  across  subjects,  conditions,  or  scalp 
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locations.  As  a  consequence,  procedures  are  needed  to  identify 
and  measure  these  segments. 

One  the  most  commonly  used  procedures  is  Principal  Component 
Analysis  (PCA) .  This  procedure  can  be  applied  to  describe  the 
structure  of  the  variance  associated  with  a  set  of  ERP  waveforms. 
This  structure  is  inferred  from  the  pattern  of  covariance  between 
the  voltage  values  for  each  timepoint  in  the  waveform.  The 
outcome  of  PCA  is  a  set  of  components  each  characterized  by  a 
particular  pattern  of  weights  (loadings,  or  eigenvectors)  for 
each  timepoint  (see  Figure  6) .  Then,  the  weights  for  each 
component  can  be  used  as  a  linear  filter  to  derive  measures  of 
amplitude  of  each  component  in  a  particular  waveform.  Each  point 
in  the  waveform  is  multiplied  by  the  weight  for  that  point.  The 
resulting  values  are  summed  to  yield  factor  scores  (or  measures 
of  component  amplitude) .  A  more  detailed  description  of  the 
application  of  PCA  to  ERPs  can  be  found  in  Donchin  and  Heffley 
(1978)  . 

3.3.3  Source  activity  measures.  A  third  way  of  defining 
components  is  in  terms  of  underlying  sources.  According  to  this 
definition,  we  should  quantify  the  activity  of  these  sources  to 
provide  latency  and  amplitude  measures  of  the  different 
components.  As  we  noted  earlier  (Section  2) ,  the  relationship 
between  scalp  electrical  activity  and  source  activity  is 
difficult,  if  not  impossible,  to  describe.  Thus,  for  the  time- 
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being,  this  type  of  component  measurement,  although  theoretically 
important,  is  practically  unfeasible. 

3.3.4  Problems  in  component  measurement.  In  this  section 
we  discuss  two  specific  problems  that  arise  during  component 
measurement. 

The  first  problem  concerns  the  commensurability  of  the 
measurements  of  different  waveforms.  Is  a  particular  component, 
recorded  under  a  particular  set  of  circumstances,  the  same  as 
that  recorded  in  another  situation?  This  is  a  particular  problem 
when  we  define  components  as  a  peak  observed  at  a  particular 
latency.  For  example,  if  the  latency  of  the  peak  differs  between 
two  experimental  conditions,  we  would  be  led  to  conclude  that 
different  components  are  present  in  the  two  sets  of  data.  How 
can  we  be  sure  that  the  same  component  varies  in  latency  between 
the  two  conditions,  rather  than  that  two  different  components  are 
present  in  the  two  different  conditions?  The  problem  is  not 
resolved  by  using  PCA.  In  this  case,  how  can  we  know  whether  two 
components  extracted  by  separate  PCAs  conducted  on  different  sets 
of  data  reflect  the  same  activity?  A  solution  to  this  problem 
can  only  be  derived  from  a  careful  examination  of  the  pattern  of 
results  obtained,  and  from  a  comparison  of  these  results  with 
what  we  already  know  about  different  ERP  components.  Of  course, 
this  means  that  we  are  including  a  large  number  of  empirical  and 
theoretical  arguments  in  the  definition  of  each  ERP  component, 
which,  of  course,  may  differ  from  one  component  to  another,  and 
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from  time  to  time.  Thus,  the  definition  of  a  component  may 
include  not  only  polarity  and  latency,  but  also  distribution 
across  the  scalp  and  sensitivity  to  experimental  manipulations 
(see,  for  example,  Fabiani  et  al.,  in  press).  Thus,  it  is  clear 
that  a  correct  interpretation  of  ERP  component  structure  requires 
some  background  information  about  the  components  themselves  (see 
section  4 ) . 

A  second  problem  in  component  measurement  is  that  of 
component  overlap.  Usually,  ERP  components  do  not  appear  in 
isolation,  but  several  of  them  may  be  active  at  the  same  moment 
in  time.  This  reflects  the  parallel  nature  of  brain  processes. 
When  this  occurs,  it  is  difficult  to  attribute  a  particular 
portion  of  scalp  activity  to  a  particular  component.  Peak  and 
area  measures  are  particularly  susceptible  to  this  problem,  but 
even  PCA  can,  in  some  cases,  misallocate  variance  across 
different  components  (see  Wood  &  McCarthy,  1984) .  As  a  result, 
we  may  attribute  a  difference  obtained  between  two  particular 
experimental  conditions  to  the  wrong  component. 

Several  procedures  have  been  proposed  to  solve  the  problem 
of  component  overlap,  but  none  of  them  seems  to  have  universal 
validity.  In  some  cases,  it  can  be  assumed  that  only  one 
component  varies  between  two  experimental  conditions.  In  this 
case,  the  variation  of  this  component  can  be  isolated  by 
subtracting  two  sets  of  waveforms.  Unfortunately,  very  rarely 
can  we  assume  that  the  effect  of  an  experimental  variable  is  so 


Event-Related  Brain  Potentials 

Page  21 

selective <-  Furthermore,  the  subtraction  procedure  implies  that 
only  amplitude,  and  not  latency,  varies  across  experimental 
conditions . 

A  procedure  that  may  help  solve  both  these  problems  is 
vector  filtering  (Gratton,  Coles,  &  Donchin,  submitted) .  This 
procedure  begins  with  the  assumption  that  components  can  be 
defined  in  terms  of  scalp  distribution.  Any  component  can  be 
represented  by  a  specific  profile  of  amplitude  values  at 
different  electrode  locations.  This  profile  is  then  used  as  a 
"distributional  filter"  to  determine  the  extent  to  which  the 
component  is  present  in  a  particular  epoch  (see  Figure  7) .  This 
procedure  can  be  applied  to  several  components  simultaneously, 
and,  thus,  it  is  possible  to  separate  the  contribution  of 
several,  overlapping,  components  to  an  observed  waveform.  Note 
that,  in  contrast  to  PCA,  neither  the  latency  nor  the  time-course 
of  the  component  (but  only  the  scalp  distribution)  need  be 
constant  over  trials.  The  assumption  of  constancy  of 
distribution  may  be  more  valid  than  assumptions  of  constancy  of 
latency  and  time-course. 

Note  that  distributional  filters  perform  the  same  Kind  of 
operations  in  the  spatial  domain  that  frequency  filters  perform 
in  the  frequency  domain.  While  frequency  filters  apply  different 
weights  to  activity  in  different  frequency  bands,  distributional 
filters  apply  different  weights  to  activity  from  different 
spatial  locations. 
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3.4  Experimental  Logic 

In  this  section,  we  review  the  experimental  procedures  that 
lead  to  a  specification  of  the  functional  significance  of  the 
components.  Some  theory  about  the  functional  significance  of  a 
component  is  essential  to  the  understanding  of  the  changes  that 
this  component  will  exhibit  as  a  function  of  specific  contexts. 
The  development  of  a  theory  about  the  functional  significance  of 
a  component  is  a  complex  process  that  involves  studies  of  the 
component's  antecedents  and  consequences,  as  well  as  speculations 
about  the  psychological  function  it  manifests. 

To  illustrate  this  process,  we  shall  focus  on  the  approach 
adopted  in  our  laboratory  (see  Coles,  Gratton,  &  Gehring,  1987; 
Donchin,  1981;  Donchin  &  Coles,  in  press;  Donchin,  Karis, 

Bashore,  Coles,  &  Gratton,  1986).  While  this  approach  is 
oriented  towards  cognitive  ERP  research,  and  to  the  P300 
component  in  particular,  there  is  no  reason,  in  principle,  why  it 
could  not  be  applied  to  other  approaches  to  ERPs  and  to  other 
components . 

3.4.1  Antecedent  conditions.  In  the  case  of  all  ERP 
components,  the  initial  phase  in  the  process  begins  with  the 
"discovery"  of  a  component.  For  most  endogenous  ERP  components, 
this  discovery  occurred  during  the  1960s  and  1970s;  for  the  P300, 
this  occurred  in  1965  (Sutton,  Braren,  Zubin,  4  John,  ’965). 

After  this  report,  there  followed  a  period  in  which  attempts  were 
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made  to  map  out  the  antecedent  conditions  that  influence  the 
amplitude  and  latency  of  the  component. 

For  amplitude,  a  large  series  of  studies  focussed  on  the 
"oddball"  paradigm  in  which  subjects  are  presented  with  two 
stimuli,  or  classes  of  stimuli,  that  occur  in  a  Bernouilli 
sequence.  The  probability  of  one  stimulus  is  generally  less  than 
that  for  the  other  and  the  subject's  task  is  to  count  the  rarer 
of  the  two  stimuli.  The  basic  conclusion  of  these  kinds  of 
studies  is  that  the  amplitude  of  the  P300  is  sensitive  to  the 
probability  of  task  relevant  events  (see,  for  example,  Duncan- 
Johnson  &  Donchin,  1977,  and  Figure  8;  see  also  Donchin  et  al., 
1986a) .  Further  research  indicated  that  it  is  subjective,  rather 
than  objective,  probability  that  controls  the  amplitude  of  P300 
(Squires,  Wickens,  Squires,  &  Donchin,  1976).  In  addition,  the 
stimuli  or  stimulus  classes  can  be  as  diverse  as  male  versus 
female  names  (Kutas,  McCarthy,  &  Donchin,  1977)  or  pictures  of 
politicians  versus  pictures  of  others  (Towle,  Heuer,  &  Donchin, 
1980)  ,  or  even  the  absence  of  the  stimulus  (Sutton,  Tueting, 
Zubin,  &  John,  1967) .  Furthermore,  the  scalp  distribution  is 
independent  of  the  modality  of  the  stimulus  (Simson,  Vaughan,  & 
Ritter,  1976) . 

A  second  factor  controlling  the  amplitude  of  P300  is  the 
task-relevance  of  the  eliciting  event.  Thus,  P300s  are  only 
elicited  if  the  subject  must  use  the  stimuli  to  perform  the 
assigned  task.  If  the  events  occur  while  the  subject  is 
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performing  another  task  (such  as  a  word  puzzle)  then  even  rare 
events  do  not  elicit  the  P300  (see  Figure  8:  Duncan-Johnson  & 
Donchin,  1977)  .  Furthermore,  the  P300  to  an  event  is  directly 
related  to  the  event's  utility  in  terms  of  the  subject's  task 
(Johnson  &  Donchin,  1978;  Bosco  et  al.,  1986). 

A  final  series  of  studies  (see  Donchin,  Kramer,  &  wickens, 
1986)  has  demonstrated  that  the  P300  is  related  to  the  processing 
resources  demanded  by  a  particular  task.  In  a  dual-task 
situation,  P300  amplitude  to  primary-task  events  increases  with 
the  perceptual/cognitive  resource  demands,  while  the  P300 
response  to  the  concurrent  secondary-task  decreases  (Sirevaag, 
Kramer,  Coles,  &  Donchin,  1984) . 

As  far  as  P300  latency  is  concerned,  the  research  has 
focussed  on  the  identification  of  those  processes  that  have 
elapsed  prior  to  the  elicitation  of  P300.  As  an  initial 
observation,  it  can  be  noted  that,  if  P300  amplitude  is  sensitive 
to  probability,  then,  at  the  very  least,  processes  required  to 
establish  the  rareness  of  an  event  must  occur  prior  to  the  P300 
process.  On  the  basis  of  this  observation,  Donchin  (1979) 
proposed  that  P300  latency  may  reflect  stimulus  evaluation  or 
categorization  time.  This  idea  was  supported  by  the  observation 
that  the  correlation  between  P300  latency  and  reaction  time  is 
higher  when  subjects  are  given  accuracy  rather  than  speed 
instructions.  Furthermore,  as  categorization  becomes  more 
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difficult,-  so  P300  latency  becomes  longer  (see  Figure  9:  Kutas, 
McCarthy,  &  Donchin,  1977) . 

If  P300  latency  reflects  stimulus  evaluation  time,  then  it 
should  not  be  affected  by  factors  that  influence  response-related 
processes.  Several  studies  (McCarthy  &  Donchin,  1981;  Magliero, 
Bashore,  Coles,  &  Donchin,  1984?  Ragot,  1984)  demonstrate  that 
manipulations  that  should  affect  the  duration  of  response-related 
processes  (i.e.  stimulus-response  compatibility)  have  little  or 
no  effect  on  P300  latency,  while  manipulations  of  stimulus 
complexity  have  a  large  effect.  (Both  these  manipulations  have  a 
substantial  effect  on  reaction  time.)  Thus,  it  appears  that  the 
P300  is  more  dependent  on  the  completion  of  processes  of  stimulus 
evaluation  and  categorization  than  on  those  related  to  the 
current  overt  response. 

3.4.2  Formation  of  theory.  Taken  together,  these  data 
suggest  that  the  P300  is  dependent  on  (a)  the  subjective 
probability  of  task  relevant  effects,  (b)  the  value  or  meaning  of 
the  event  in  the  context  of  the  task,  and  (c)  the  psychological 
resources  allocated  to  the  processing  of  the  event.  Furthermore, 
the  P300  is  not  emitted  until  the  event  has  been  categorized.  If 
responses  are  based  on  incomplete  stimulus  evaluation,  the  P300 
will  occur  after  the  overt  response.  (See  also  Johnson,  1986,  for 
a  discussion  of  the  determinants  of  P300,  and  Donchin  et  al., 
1984,  for  a  discussion  of  the  relationship  between  P300  and  the 
orienting  reflex.) 
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With  these  observations  in  mind,  one  can  proceed  to 
speculate  about  the  functional  significance  of  the  P300.  Thus, 
Donchin  (1981;  Donchin  &  Coles,  in  press)  argued  that  it  is  a 
manifestation  of  a  process  related  to  the  updating  of  models  of 
the  environment  or  context  in  working  memory.  Such  an  updating 
will  depend  on  the  processing  of  the  current  event  (hence,  the 
P300  latency/stimulus  evaluation  time  relationship)  but  will  have 
implications  for  the  processing  of  and  response  to  future 
events. 2 

3.4.3  Consequences.  These  kinds  of  statements  relating  to 
functional  significance  can  be  tested  by  an  examination  of  the 
predicted  consequences  of  variation  in  the  latency  or  amplitude 
of  the  P300  for  the  outcome  of  the  interaction  between  the 
subject  and  the  environment. 

For  example,  as  far  as  P3  00  latency  is  concerned,  if  the 
P300  occurs  after  the  stimulus  has  been  evaluated,  then  the 
quality  of  the  subject's  response  to  that  event  should  depend  on 
the  timing  of  that  response  relative  to  the  occurrence  of  the 
P300 .  Thus,  Coles,  Gratton,  Bashore,  Eriksen,  and  Donchin  (1985) 
showed  that,  for  a  given  response  latency,  response  accuracy  was 


2  Note  that  other  theories  of  P300  have  been  offered  by,  for 
example,  Desmedt  (1981),  Rosier  (1983),  and  Verleger  (in  press). 
Both  Desmedt  and  Verleger  propose  that  the  P300  is  related  to  the 
termination  or  "closure"  of  processing  periods,  while  Rosier 
proposed  that  P300  reflects  controlled  processing.  As  we  have 
argued  elsewhere  (Donchin  &  Coles,  in  press),  these  theories  do 
not  account  for  the  richness  of  the  results  concerning  the 
consequences  of  the  P300  that  have  accrued  in  the  last  decade. 
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higher  the  shorter  the  P300  latency.  This  is  illustrated  in 
Figure  10. 

For  P300  amplitude,  we  expect  that,  to  the  extent  that  the 
subject's  future  behavior  depends  on  the  degree  to  which  an  event 
leads  to  a  change  in  their  model  of  the  environment,  that 
behavior  will  be  related  to  P300  amplitude.  Thus,  there  have 
been  several  studies  that  have  demonstrated  a  relationship 
between  the  memorability  of  an  event,  assessed  at  some  future 
time,  and  the  amplitude  of  the  P300  response  to  the  event  at  the 
time  of  initial  presentation  (see  section  4.4.1  for  a  detailed 
discussion) . 

As  another  example,  we  have  demonstrated  that  the  subject's 
future  strategy  as  revealed  in  overt  behavior  can  be  predicted 
from  the  P300  response  to  current  events  (Donchin,  Gratton, 
Dupree,  &  Coles,  in  press) .  In  particular,  in  a  speeded  choice 
reaction  time  task,  the  amplitude  of  the  P300  following  an  error 
was  related  to  the  latency  and  accuracy  of  overt  responses  on 
subsequent  trials. 

These  kinds  of  experiments  that  investigate  the  predicted 
consequences  of  variation  in  the  P300  response  for  overt  behavior 
provide  tests  of  the  theories  of  the  functional  significance  of 
the  P300 .  As  a  result,  the  theories  are  refined  or  revised, 
until,  eventually,  we  are  confident  enough  to  proceed  to  use  the 
ERP  measure  to  monitor  information  processing  activities  (see, 
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for  exampTe,  Coles  et  al.,  1985;  Kramer,  Sirevaag,  &  Braune, 

1987)  . 


3.5  The  role  of  neurophysiology 

As  we  noted  earlier,  there  are  serious  problems  in  trying  to 
infer  the  source  of  a  given  ERP  component  from  its  distribution 
on  the  scalp.  Thus,  the  identification  of  a  particular  ERP 
component  with  a  particular  intra-cranial  source  must  depend  on  a 
variety  of  converging  methodologies  including  magnetic  and  intra¬ 
cranial  recordings  in  humans  and  animals,  neuropsychological 
research  and  lesion  studies  in  animals. 

What  are  the  consequences  of  knowing  the  intra-cranial 
source  for  theorizing  about  the  functional  significance  of  an  ERP 
component?  We  should  emphasize  that  by  "functional  significance” 
we  mean  a  specification  of  the  information  processing 
transactions  that  are  manifested  by  the  component,  not  its 
neurophysiological  significance. 

In  the  case  of  the  P300,  there  has  been  considerable 
speculation  that  at  least  one  source  lies  in  the  hippocampus 
(Okada,  Kaufman,  &  Williamson,  1983;  Halgren  et  al.,  1980).  If, 
in  fact,  this  speculation  is  supported  by  future  research,  we 
could  incorporate  what  is  known  about  the  psychologically 
relevant  functions  of  the  hippocampus  into  our  theory  of  the  P300 
(and  vice  versa) .  In  this  regard,  it  is  interesting  to  note  the 
similarity  between  theories  of  hippocampal  function  that 


Event-Related  Brain  Potentials 

Page  29 

emphasize  its  role  in  memory  (O'Keefe  &  Nadel,  1978)  and  the 
theory  of  P300  outlined  in  the  previous  sections  that  emphasizes 
its  relationship  to  "context  updating." 

Note  that  we  can  articulate  a  theory  of  the  functional 
significance  of  P300  without  any  Icnowledge  of  its  neural  origin. 
However,  when  its  neural  origin  is  Known,  then  the  theory  can  be 
refined  and  developed  on  the  basis  of  what  is  known  about  the 
underlying  neural  structures.  In  this  sense,  then, 
neurophysiological  knowledge  may  be  useful,  but  not  necessarily 
critical,  to  the  psychophysiological  enterprise.  Of  course, 
such  knowledge  is  critical  for  those  who  wish  to  use  ERPs  as  a 
sign  of  neurophysiological  function. 

3.6  Using  the  measure:  Psychophysiological  inference 

In  the  preceding  sections,  we  have  considered  the  ways  in 
which  ERP  researchers  establish  the  functional  significance  of 
the  ERP  components.  In  a  sense,  this  research  can  be  considered 
as  establishing  the  validity  of  ERP  components  as  measures  of 
particular  psychological  activities.  In  this  section,  we 
consider  the  ways  in  which  the  measures  are  used  to  make 
inferences  about  psychological  processes.  We  shall  review  a 
series  of  inferential  steps,  from  the  crudest  to  the  most 
sophisticated,  that  depend  to  a  greater  and  greater  extent  on 
assumptions  about  the  functional  significance  of  the  ERP.  For 
the  purposes  of  elucidating  the  inferential  process,  we  shall 
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consider-  an  experiment  in  which  subjects  are  run  in  two  different 
conditions . 

3.6.1  inference  1:  Conditions  ere  different.  At  the  most 
fundamental  level,  we  can  ask  whether  or  not  the  two  conditions 
are  associated  with  different  responses.  The  analytic  procedure 
necessary  to  answer  this  question  would  involve  a  univariate  or 
multivariate  analysis  of  variance  (with  condition  and  timepoint 
as  factors) .  Given  that  such  an  analysis  yields  a  significant 
condition  by  timepoint  interaction,  we  can  infer  that  the 
conditions  are  different.  If  we  assume  that  the  ERP  is  a  sign  of 
brain  activity  and/or  that  it  reflects  some  psychological 
process,  then  we  can  infer  that  the  brain  activity,  and 
associated  psychological  processing,  are  different  in  the  two 
conditions. 

3.6.2  Inference  2:  Condi tiona  differ  at  a  particular  time. 
The  second  level  of  inference  concerns  the  time  at  which  the  two 
conditions  differ.  This  inference  could  be  made  on  the  basis  of 
post-hoc  tests  of  the  significant  condition  by  time  interaction. 
It  would  take  the  form  of  "by  at  least  ms  X,  processing  of 
stimuli  in  condition  A  is  different  than  processing  of  stimulus 
in  condition  B".  This  kind  of  inference  is  frequently  made  in 
studies  of  selective  attention  where  an  important  theoretical 
issue  concerns  the  relative  time  at  which  an  attended  event 
receives  preferential  processing.  As  with  the  most  primitive 
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form  of  inference,  we  need  only  assume  that  the  ERP  is  a 
reflection  of  some  aspect  of  psychological  processing. 

3.6.3  Inference  3:  conditions  differ  with  respect  to  the 
latency  of  soae  process.  For  this  level  of  inference,  additional 
assumptions  and  measurement  operations  must  be  made.  First,  we 
must  assume  that  the  latency  of  a  particular  ERP  component  is 
related  to  the  latency  of  a  particular  psychological  process. 

For  example,  we  argued  previously  that  P300  does  not  occur  until 
the  evaluation  process  is  complete.  Second,  we  must  adopt  a 
procedure  to  identify  the  component  in  question  and  to  measure 
its  latency  (see  section  3).  Then,  we  use  an  analytic  procedure 
(analysis  of  variance,  t-test,  etc.)  to  evaluate  the  difference 
between  the  conditions  with  respect  to  the  component  latency.  As 
a  result  of  this  procedure,  we  make  the  inference  that  the 
conditions  differ  with  respect  to  the  timing  of  process  X. 

3.6.4  inference  4:  conditions  differ  with  respect  to  the 
degree  to  which  soae  process  occurs.  At  the  most  complex  level, 
we  can  use  ERPs  to  infer  that  a  particular  process  occurs  to  a 
greater  degree  under  one  condition  than  under  another.  In  this 
case,  we  must  assume  that  a  particular  ERP  component  is  a 
manifestation  of  process  X.  We  must  further  assume  that  changes 
in  the  magnitude  of  the  component  correspond  directly  to  changes 
in  the  degree  to  which  the  process  is  invoked.  Then,  we  must 
devise  a  suitable  measurement  procedure  to  identify  and  assess 
the  magnitude  of  the  component.  Finally,  we  can  proceed  with  the 
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usual  inferential  test  and  determine  whether  or  not  the 
conditions  differ  with  respect  to  the  degree  of  process  X. 


Event-Related  Brain  Potentials 

Page  33 


4.  A  Selective  Review  of  ERPs 

In  this  section,  we  consider  some  of  the  research  on  the 
functional  significance  of  a  variety  of  different  ERP  components. 
We  begin  with  a  discussion  of  ERPs  that  occur  prior  to  a  marker 
event.  This  is  followed  by  a  brief  overview  of  sensory  and 
cognitive  ERP  components  that  occur  after  the  marker  event. 

4.1  Event-preceding  potentials 

4.1.1  Movement-related  potentials.  One  class  of  event- 
preceding  potentials  includes  those  that  are  apparently  related 
to  the  preparation  for  movement.  These  potentials  were  first 
described  by  Kornhuber  and  Deecke  (1965)  who  found  that,  prior  to 
voluntary  movements,  a  negative  potential  develops  slowly, 
beginning  some  800  ms  before  the  initiation  of  the  movement  (see 
Figure  11) .  These  "Readiness  Potentials"  (or 
"Bereitschaftspotentials")  were  distinguished  from  those  that 
followed  the  movement,  the  "Reafferent  Potentials."  In  a 
condition  in  which  a  similar,  but  passive  movement  was  involved, 
only  post-movement  potentials  were  observed.  Both  readiness  and 
reafferent  potentials  tend  to  be  maximal  at  electrodes  located 
over  motor  areas  of  the  cortex.  Furthermore,  some  components  of 
the  potentials  are  larger  at  electrode  locations  contralateral  to 
the  responding  limb  (at  least  for  hand  and  finger  movements) . 
Indeed,  this  kind  of  lateralization  has  become  an  important 
criterion  for  movement-related  potentials. 
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The  investigation  of  movement-related  potentials  has 
developed  along  several  different  paths,  including  (a)  the 
discovery  and  classification  of  different  components  of  the 
movement-related  potential  (for  reviews,  see  Brunia,  Haagh,  & 
Scheirs,  1985,  and  Deecke  et  al.,  1984),  (b)  analysis  of  the 
neural  origin  using  the  scalp  topography  of  ERPs  (e.g. ,  Vaughan, 
Costa,  &  Ritter,  1972)  or  magnetic  field  recordings  (e.g.,  Okada, 
Williamson,  &  Kaufman,  1982;  Deecke,  Weinberg  &  Brickett,  1982), 
(c)  analysis  of  the  functional  significance  of  different 
components  (see  Brunia  et  al.,  1985,  and  Deecke  et  al.,  1984,  for 
reviews) ,  and  (d)  recording  of  movement-related  potentials  in 
special  populations  (e.g.,  in  mentally  retarded  children,  Karrer 
&  Ivins,  1976,  and  in  Parkinson's  patients,  Deecke,  Englitz, 
Kornhuber,  &  Schmidt,  1977).  in  general,  these  studies  confirm 
that  the  potential  described  by  Kornhuber  and  Deecke  is 
generated,  at  least  in  part,  by  neuronal  activity  in  motor  areas 
of  the  cortex  and  is  a  reflection  of  processes  related  to  the 
preparation  and  execution  of  movements. 

Recently,  movement-related  potentials  have  been  applied  to 
the  investigation  of  human  information  processing.  In 
particular,  we  have  used  the  measure  to  index  the  commitment  to  a 
specific  motor  response  in  choice  reaction-time  paradigms  (see 
Bosco  et  al.,  1986;  De  Jong,  wierda.  Mulder,  fc  Mulder,  in  press: 
Gehnng,  Gratton,  Coles,  fc  Donchin,  1986;  Gratton,  Coles, 
Sirevaag,  Eriksen,  i  Donchin.  in  press;  Kutas  6  Donchin,  198: 
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We  have  demonstrated  that  the  speed  and  accuracy  of  a  subject's 
reaction  time  response  is,  in  part,  related  to  the  degree  of 
prior  preparation  manifested  by  the  movement-related  potential. 

4.1.2  The  contingent-negative  variation  (CNV) .  The  CNV  was 
first  described  by  Walter,  Cooper,  Aldridge,  McCallum,  &  winter 
(1964)  as  a  slow  negative  wave  that  occurs  during  the  foreperiod 
of  a  reaction  time  task  (see  Figure  12) .  The  wave  tends  to  be 
largest  over  centre  1  (vertex)  and  frontal  areas.  Researchers 
investigating  the  functional  significance  of  CNV  have  manipulated 
several  aspects  of  the  S1-S2  paradigm,  including  the  subject's 
task,  the  discriminability  of  the  imperative  stimulus,  foreperiod 
duration,  stimulus  probability,  presence  of  distractors,  etc. 

The  component  has  been  variously  described  as  related  to 
expectancy,  mental  priming,  association,  and  attention  (for  a 
review,  see  Donchin,  Ritter,  &  McCallum,  1978;  Rohrbaugh  & 
Gaillard,  1983)  . 

A  central  controversy  in  research  in  this  area  over  the  past 
decade  concerns  whether  the  CNV  consists  of  one,  or  several, 
functionally  distinct,  components.  A  further,  but  related, 
question  is  whether  the  late  portion  of  the  CNV  (just  prior  to 
the  imperative  stimulus)  reflects  more  than  the  process  of  motor 
preparation  as  the  subject  anticipates  making  a  response  to  the 
imperative  stimulus.  This  controversy  was  raised  by  Loveless  and 
-■is  ro-wcrcers  e.g..  Loveless  fc  Sanford,  1974:  see  also,  Connor 
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&  Lang,  -1969)  who  argued  that  the  CNV  consists  of  two  components, 
an  early  orienting  wave  (the  O-wave)  and  a  later  expectancy  wave 
(the  E-wave) .  Subsequent  research  by  these  investigators  led 
them  to  argue  that  the  E-wave  is  a  readiness  potential  and 
reflects  nothing  more  than  motor  preparation.  Research  by 
Rohrbaugh  et  al.  (1976)  and  by  Gaillard  (1978;  see  also  Rohrbaugh 
&  Gaillard,  1983)  also  supports  this  interpretation.  However, 
the  question  of  the  functional  significance  of  the  latter 
component  (the  E-wave)  remains  controversial.  Some  investigators 
have  claimed  that,  because  a  late  E-wave  is  evident  even  in 
situations  in  which  no  overt  motor  response  is  required,  the  E- 
wave  has  a  significance  over  and  above  that  of  motor  preparation. 
However,  it  is  clear  that  even  though  the  overt  motor  response 
requirement  may  be  removed  from  these  situations,  attention  to  a 
stimulus  necessarily  involves  some  motor  activity  associated  with 
adjustment  of  the  sensory  apparatus.  Perhaps  the  most  persuasive 
arguments  for  a  non-motor  role  for  the  late  CNV  comes  from  a 
recent  study  of  Damen  and  Brunia  (1987).  These  authors  found 
evidence  for  a  motor- independent  wave  that  precedes  the  delivery 
of  feedback  information  in  a  time-estimation  task. 

4 . 2  Sensory  components 

The  presentation  of  stimuli  in  the  visual,  auditory,  or 
somatosensory  modality  elicits  a  series  of  voltage  oscillations 
that  can  be  recorded  from  scalp  electrodes.  In  practice,  sensory 
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potentials  can  be  elicited  either  by  a  train  of  relatively  high 
frequency  stimuli  or  by  transient  stimuli.  In  the  former  case, 
the  ERP  responses  to  different  stimuli  overlap  in  time.  The 
waveforms  have  quite  fixed  periodic  characteristics  that  are 
driven  by  the  periodic  stimulation,  and  are  therefore  referred  to 
as  "steady-state"  (see  Regan,  1972).  In  the  case  of  transient 
stimuli,  the  responses  from  different  stimuli  are  separated  in 
time. 

For  both  steady-state  and  transient  potentials  the 
potentials  appear  to  be  obligatory  responses  of  the  nervous 
system  to  external  stimulation.  In  fact,  the  earlier  components 
of  all  sensory  potentials  (within,  say,  100  ms)  are  invariably 
elicited  whenever  the  sensory  system  of  interest  is  intact.  In 
this  sense,  they  are  described  as  "exogenous"  potentials.  They 
are  thought  to  represent  the  activity  of  the  sensory  pathways 
that  transmit  the  signal  generated  at  peripheral  receptors  to 
central  processing  systems.  Therefore,  these  components  are 
"modality  specific,"  that  is,  they  differ  both  in  waveshape  and 
scalp  distribution  as  a  function  of  the  sensory  modality  in  which 
the  eliciting  stimulus  is  presented.  As  would  be  expected  of 
manifestations  of  primitive  sensory  processes,  the  sensory 
components  are  influenced  primarily  by  stimulus  parameters  such 
as  intensity,  frequency,  etc.  For  a  review  of  these  components, 
see  Hillyard,  Picton,  and  Regan  (1978). 
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For  clinical  purposes,  sensory  evoked  potentials  are  used  in 
the  diagnosis  of  neurological  diseases  (i.e.,  demyelinating 
diseases,  cerebral  tumors  and  infarctions,  etc.)*  Of  particular 
diagnostic  importance  are  the  auditory  potentials  (diseases 
involving  the  posterior  fossa) ,  and  the  steady-state  visual 
potential  (multiple  sclerosis) .  Auditory  potentials  can  also  be 
used  to  diagnose  hearing  defects  in  uncooperative  subjects. 

Since  most  sensory  potentials  appears  to  be  insensitive  to 
psychological  factors,  they  have  not  been  used  extensively  in  the 
study  of  psychological  processes.  We  should  note,  however,  that 
there  have  been  reports  that  some  of  the  middle-latency 
components  (between  10  and  100  ms  after  stimulus)  may  reflect 
selective  attention  (McCallum,  Curry,  cooper,  Pocock,  & 
Papakostopoulos,  1983;  Michie,  Bearpark,  Crawford,  &  Glue,  1987). 
The  relationship  between  the  N100  component  (in  the  visual, 
auditory,  and  somatosensory  modality)  and  attention  will  be 
reviewed  later  in  this  section. 

4.3  The  early  negativities 

Several  negative  components  have  been  described  in  the 
period  between  100  and  300  ms  after  the  presentation  of  an 
external  stimulus.  In  this  section,  we  will  examine  two  families 
of  negative  components  that  have  been  associated  with  selective 
attention  and  elementary  feature  analysis.  Although  these 
components  have  been  grouped  because  of  functional  similarities 
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and  latency  into  a  few  large  sub-groups  (NIOOs,  N200s,  etc.), 
their  scalp  distribution  and  morphology  vary  as  a  function  of  the 
modality  of  the  eliciting  stimulus.  Therefore,  these  potentials 
may  be  considered  to  lie  at  the  interface  between  "purely" 
exogenous  (described  in  section  4.2)  and  "purely"  endogenous 
components  (described  later  in  this  section) . 

4.3.1  The  HlOOs.  First  indications  that  ERPs  could  be  used 
to  investigate  attentional  processes  came  from  studies  in  which 
the  ERP  response  to  attended  stimuli  was  compared  to  that  to 
unattended  stimuli  (e.g.,  Eason  et  al,  1964;  Hillyard,  Hink, 
Schwent,  &  Picton,  1973).  These  kinds  of  studies  suggested  that 
attended  stimuli  are  associated  with  a  more  negative  ERP  between 
100  and  200  ms.  Subsequent  research  has  been  concerned  with  two 
issues:  (a)  the  use  of  ERPs  to  test  theories  of  selective 
attention,  and  (b)  the  nature  of  the  attentional  effect  on  ERPs. 

"Selective  attention"  refers  to  the  ability  of  the  human 
information  processing  system  to  selectively  analyze  some  stimuli 
and  ignore  others.  Two  metaphors  have  been  associated  with 
selective  attention,  that  of  filtering  (see  Broadbent,  1957)  and 
that  of  resources  (see  Kahnemann,  1973,  Norman  &  Bobrow,  1975). 
Filtering  theories  have  focussed  on  the  debate  about  the  locus  of 
the  filter:  does  filtering  occur  at  an  early,  perceptual  level 
(early  selection  theories,  Broadbent,  1957)  or  at  later  stages  of 
processing  (late  selection  theories,  Deutsch  6  Deutsch,  1963)? 
According  to  the  resource  metaphor,  selective  attention  is  a 
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mechanism  isy  which  the  system  allocates  more  resources  to  process 
information  coming  through  a  particular  attended  channel  than 
through  other  unattended  channels. 

In  a  typical  paradigm  (Hillyard,  Hink,  Schwent,  &  Picton, 
1973),  four  types  of  stimuli  are  presented.  The  stimuli  (tones) 
differ  along  two  dimensions  (location  and  pitch) ,  each  having  two 
levels  (left  vs.  right  ear  and  standard  vs.  deviant  pitch) .  The 
subject  is  instructed  to  attend  to  stimuli  at  a  particular 
location  and  to  detect  target  tones  of  a  deviant  pitch  (e.g., 
left  ear  tones  of  high  pitch) .  To  investigate  attention  effects, 
ERPs  to  standard  tones  occurring  in  the  attended  location 
(channel)  are  compared  to  those  to  standard  tones  in  the 
unattended  channel. 

Using  this  paradigm,  Hillyard  and  his  colleagues  have 
observed  a  larger  negativity  with  a  peak  latency  of  about  100-150 
ms  for  stimuli  presented  in  the  attended  channel  (see  Figure  13 
which  shows  data  from  a  similar  experiment  by  Knight,  Hillyard, 
Woods,  &  Neville,  1981).  The  moment  in  time  at  which  the 
waveforms  for  attended  and  unattended  stimuli  diverge  is 
considered  as  the  time  at  which  filtering  starts  playing  a  role. 
Hillyard  and  his  colleagues  originally  interpreted  their  data  in 
terms  of  a  modulation  of  the  sensory  N100  response  (Hillyard  et 
al.,  1973).  Thus,  they  considered  these  data  as  evidence  for 


early  selection. 
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Later  research  has  shown  that  the  difference  between  the  ERP 
waveforms  for  the  attended  and  unattended  channels  cannot  be 
considered  only  in  terms  of  the  amplitude  variation  of  a  peak 
with  a  latency  between  100  and  150  ms  (N100) .  Rather,  it  can  be 
characterized  by  the  superimposition  of  a  negative  component 
lasting  several  hundred  milliseconds  (Naatanen,  1982) .  This 
component  has  been  labelled  "Processing  Negativity".  The  onset 
latency  of  the  Processing  Negativity  is  related  to  the  difficulty 
of  the  discrimination  between  the  attended  and  the  unattended 
channel  (Hansen  &  Hillyard,  1983) .  The  Processing  Negativity 
interpretation  of  the  selective  attention  effect  is  consistent 
with  the  resource  metaphor.  Rather  than  indicating  a  filtering 
process  in  the  information  processing  flow,  the  Processing 
Negativity  might  reflect  a  selective  allocation  of  processing 
resources  to  the  attended  channel. 

Similar  variations  in  the  amplitude  of  ERP  components  with  a 
latency  of  100  to  200  ms  have  been  observed  for  visual  (Harter  & 
Aine,  1984)  and  somatosensory  (Desmedt  &  Robertson,  1977) 
stimuli.  In  these  modalities,  however,  the  difference  between 
the  attended  and  unattended  channels  is  characterized  more  by  an 
amplification  of  the  ERP  peaks  than  by  the  superimposition  of  a 
sustained  negativity  (Harter  &  Aine,  1984;  Michie,  Bearpark, 
Crawford,  &  Glue,  1987).  Again,  this  may  be  viewed  either  in 
terms  of  early  filtering  that  reduces  the  processing  of 
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irrelevant  information,  or  in  terms  of  extra  resources  devoted  to 
process  the  critical  information. 

This  review  is  necessarily  an  oversimplification  of  the 
complexity  of  ERP  we  have  loosely  referred  to  as  "N100" .  For 
example,  in  a  recent  review  Naatanen  and  Picton  (1987)  identified 
no  less  than  six  different  components  that  are  active  around  the 
time  of  N100! 

4.3.2  The  N200s.  While  the  amplitude  of  the  N100  (or  of 
the  Processing  Negativity)  appears  to  reflect  the  selection  of 
information  from  a  particular  perceptual  channel,  the  amplitude 
of  the  N200  component  reflects  the  detection  of  deviant  features. 
As  with  "N100,”  "N200"  is  used  to  refer  to  a  family  of 
components,  one  for  each  modality,  that  are  similar  in  function 
and  latency.  Thus,  different  N200s  can  be  observed  for  the 
visual  modality  (with  maximum  at  the  occipital  electrode)  and  for 
the  auditory  modality  (with  maximum  at  the  central  or  at  the 
frontal  electrode) . 

Squires,  Squires,  and  Hillyard  (1975)  manipulated  stimulus 
frequency  and  task  relevance  independently  and  found  that  the 
N200  was  larger  for  rare  stimuli,  regardless  of  their  task 
relevance.  A  similar  observation  has  been  made  by  Naatanen 
(1982)  in  the  kind  of  paradigm  used  by  Hillyard  and  his 
colleagues  that  we  described  in  the  previous  section.  Note  that, 
to  isolate  N200  effects,  the  comparison  of  interest  is  between 
the  rare  targets  and  the  frequent  non-targets,  rather  than 
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between  the  attended  and  the  unattended  channels  (as  is  the  case 
with  MOO  effects).  N200s  will  be  observed  to  rare  stimuli 
presented  on  either  channel:  therefore  the  N200  does  not  require 
selective  attention  for  its  appearance  (Squires,  Squires  & 
Hillyard,  1975) 3 .  Not  only  does  the  amplitude  of  the  N200  depend 
on  stimulus  probability,  the  latency  of  the  N200  component  is 
dependent  on  the  difficulty  of  the  discrimination  between  target 
and  non-target  stimuli  (Naatanen,  1982) .  Furthermore,  the 
amplitude  of  the  N200  is  also  proportional  to  the  difference 
between  frequent  and  rare  stimuli  (see  Figure  14) .  Therefore, 
Naatanen  proposed  that  the  N200  reflects  che  operation  of  an 
automatic  "mismatch  detector,"  and  he  labelled  this  component 
"mismatch  negativity."  Since  the  N200  appears  to  be  related  to 
the  automatic  detection  of  surprising  (rare)  events,  this 
component  has  been  related  to  the  "orienting  reflex"  (see 
Naatanen  &  Gaillard,  1983) .  Furthermore,  since  the  N200  is 
related  to  the  automatic  processing  of  rare  features,  it  may  be  a 
reflection  of  the  automatic  stage  of  feature  analysis  proposed  by 
some  recent  theories  of  perception  (see  Treisman  &  Gelade,  1980) . 

The  N200  has  been  used  in  the  investigation  of  mental 
chronometry.  In  particular,  Ritter,  Samson,  Vaughan,  and  Macht 
(1982)  and  Renault  (1983)  have  observed  that  the  latency  of  this 

3  Note  that,  in  this  paradigm,  the  P300  is  only  elicited  by 
rare  targets  presented  on  the  attended  channel.  Thus,  the  P300 
is  only  elicited  by  task-relevant  stimuli,  while  the  N200  appears 
to  be  "automatically"  elicited  by  rare  stimuli  regardless  of 
their  task-relevance. 
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component  covaries  with  reaction  time.  The  high  correlation 
between  N200  latency  and  reaction  time  may  reflect  the  importance 
of  automatic  feature  discrimination  processes  (signalled  by  the 
N200)  in  determining  the  latency  of  the  overt  response.  However, 
the  subtraction  technique  used  by  Ritter  et  al.  (1982)  to  derive 
their  measures  of  N200  must  be  interpreted  with  caution,  since 
the  latencies  of  the  components  in  the  original  waveforms  differ. 
Furthermore,  motor  potentials,  which  are  characterized  by  a  large 
negativity,  will  also  covary  quite  strictly  with  reaction  times. 
Thus,  it  is  important  to  disambiguate  the  N200  component  from 
motor  potentials  when  the  former  component  is  used  in  the  study 
of  mental  chronometry. 

4.4  The  late  cognitive  components 

In  this  section,  we  review  a  sample  of  the  research  dealing 
with  two  major  endogenous  components,  the  P300  and  the  N400.  For 
reasons  of  space  we  do  not  discuss  in  detail  other  late 
components,  particularly  a  group  of  "Slow  Waves."  At  the  present 
time,  the  functional  significance  of  the  slow  waves  is  largely 
unknown.  However,  see  Sutton  and  Ruchkin  (1984)  and  the  research 
on  the  O-wave  (Section  4.2.2)  for  further  information. 

4.4.1  The  P300 .  In  Section  3.4,  we  reviewed  research  on 
the  antecedents  of  P300,  and  we  briefly  alluded  to  attempts  to 
evaluate  a  theory  of  P300  by  studying  its  consequences.  In  this 
section,  we  focus  on  a  series  of  studies  that  have  sought  to 
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investigate  the  relationship  between  P300  and  the  memorability  of 
events  that  elicit  it. 

As  we  noted  earlier  (Section  3.4),  unexpected  events  that 
are  relevant  to  the  subject's  task  elicit  large  P300s.  This  led 
Donchin  (1981;  Donchin  &  Coles,  in  press)  to  formulate  the 
"context  updating"  hypothesis  of  the  functional  significance  of 
P300.  This  hypothesis  allows  one  to  generate  predictions  about 
the  consequences  of  the  elicitation  of  a  large  P300  component. 

The  context  updating  hypothesis  assumes  that  the  elicitation  of  a 
P300  reflects  a  process  involved  in  the  updating  of 
representations  in  working  memory.  Rare  or  unexpected  events 
should  lead  to  an  updating  of  the  current  memory  schemas,  because 
only  by  so  doing  can  an  accurate  representation  of  the 
environment  be  maintained.  The  updating  process  may  involve  the 
"marking"  of  some  attribute  of  the  event  that  made  it 
"distinctive"  with  respect  to  other  events.  This  updating  of  the 
memory  representation  of  an  event  is  assumed  to  facilitate  the 
subsequent  recall  of  the  event,  by  providing  valuable  retrieval 
cues,  so  that  the  greater  the  updating  that  follows  an  individual 
event,  the  higher  the  probability  of  later  recalling  that  event. 
P300  amplitude  is  assumed  to  be  proportional  to  the  degree  of 
updating  of  the  memory  representation  of  the  event.  Therefore, 
as  the  updating  process  is  supposed  to  be  beneficial  to  recall, 
P300  amplitude  should  also  predict  the  subsequent  recall  of  the 
eliciting  event. 
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The'  relationship  between  P300  and  memory  has  been  tested  in 
various  paradigms.  For  example,  Karis,  Fabiani,  and  Donchin 
(1984)  recorded  ERPs  to  words  presented  in  series  that  contained 
a  distinctive  word  (an  "isolate")  (cf.  von  Restorff,  1933).  The 
isolation  was  achieved  by  changing  the  size  of  the  characters  in 
which  the  word  was  displayed.  As  is  well  documented  (Cimbalo, 
1978;  von  Restorff,  1933;  Wallace,  1965),  isolated  items  are 
better  recalled  than  are  comparable  non-deviant  items  (the  von 
Restorff  effect).  The  isolated  items,  being  rare  and  task¬ 
relevant,  can  be  expected  to  produce  large  P300s.  Thus,  we  could 
predict  that  the  recall  variance  would  be  related  to  the  very 
factors  that  are  known  to  elicit  and  control  P300  amplitude. 

Karis  et  al.  (1984)  found  that  the  magnitude  of  the  von  Restorff 
effect  depends  on  the  mnemonic  strategy  employed  by  the  subjects. 
Rote  memorizers  (i.e,,  subjects  who  rehearse  the  words  by 
repeating  them  over  and  over)  showed  a  large  von  Restorff  effect, 
and  poor  recall  performance,  relative  to  elaborators  (i.e., 
subjects  who  combine  words  into  complex  stories  or  images  in 
order  to  improve  their  recall).  For  all  subjects,  isolates 
elicited  larger  P300s  than  non-isolates.  For  rote  memorizers, 
isolates  subsequently  recalled  elicited  larger  P300s  on  their 
initial  presentation,  than  did  isolates  that  were  not  recalled. 
This  relationship  between  recall  and  P300  amplitude  was  not 
observed  in  elaborators  (see  Figure  15).  It  is  noteworthy  that 
the  amplitude  of  a  frontal-positive  slow  wave  was  correlated  with 
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subsequent  recall  in  the  elaborators,  suggesting  that  this 
component  may  be  related  to  the  degree  of  elaborative  processing. 

Karis  et  al.  (1984)  interpreted  these  data  as  evidence  that 
all  subjects  "noticed”  the  isolated  words  and  reacted  by  updating 
their  memory  representations  and  producing  large  P300s.  The 
differences  among  the  subjects  emerged  when  subjects  tried  to 
memorize  the  stimuli  by  using  different  types  of  rehearsal 
strategies.  These  strategies  interacted  with  the  retrieval 
processes:  when  subjects  used  rote  strategies,  changes  in  the 
stimulus  representation,  induced  by  the  isolation  and  manifested 
by  P300 ,  made  it  easier  to  recall  the  word.  For  the  elaborators, 
whose  recall  depended  on  the  networks  of  associations  formed  as 
the  series  were  presented,  the  effects  of  the  initial  memory 
activation  and  updating  manifested  by  P300  were  not  noticeable, 
because  they  were  overshadowed  by  the  more  powerful  elaborative 
processing  that  occurred  after  the  time  frame  of  P300. 

The  interpretation  of  the  data  of  the  Karis  et  al.  (1984) 
study  capitalized  on  different  strategies  used  by  different 
subjects.  The  hypothesis  that  the  relationship  between  P300 
amplitude  and  subsequent  recall  indeed  depends  on  the  mnemonic 
strategy  used  by  the  subject  was  tested  in  a  subsequent  study  in 
which  strategy  was  manipulated  by  instruction  on  a  within-subject 
basis  (Fabiani,  Karis,  &  Donchin,  1985;  see  also  Fox  &  Michie, 
1987)  .  Strategy  instructions  were  effective  in  manipulating  the 
performance  of  the  subjects.  When  instructed  to  use  rote 
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strategies,  subjects  recalled  fewer  words,  and  displayed  a  larger 
von  Restorff  effect,  than  when  they  used  elaborative  strategies. 
Analyses  of  the  ERPs  also  supported  our  predictions.  When 
subjects  were  instructed  to  use  rote  strategies,  the  P300s 
elicited  by  words  subsequently  recalled  were  significantly  larger 
than  those  elicited  by  words  subsequently  not  recalled.  Such  a 
relationship  was  not  observed  when  subjects  used  elaborative 
strategies.  In  addition,  when  rote  strategies  were  used, 
subjects  recalled  the  size  of  the  words  better  than  in  the 
elaborative  condition.  This  suggests  that  size  is  a  "distinctive" 
attribute  of  the  memory  representation  of  the  word,  such  as  to 
help  recall  in  the  case  of  rote  instructions. 

In  another  experiment,  Fabiani,  Karis,  and  Donchin  (1986a) 
employed  an  incidental  memory  paradigm  (an  "oddball"  task)  to 
reduce  the  use  of  elaborative  strategies.  The  results  confirmed 
the  prediction  that  stimuli  that  were  subsequently  recalled  had 
larger  P300s  at  the  time  of  initial  presentation  than  those  for 
stimuli  that  were  not  recalled. 

All  the  studies  described  above  use  either  the  von  Restorff 
paradigm  or  the  oddball  paradigm.  However,  the  memory  effect  is 
not  limited  to  isolated  or  rare  items.  It  is  also  present  for 
the  non-isolates  and  for  the  frequent  items  in  the  oddball 
experiment. 

In  several  recent  studies,  memory  paradigms  have  been  used 
that  do  not  capitalize  on  stimuli  for  which  the  P300  is  expected 
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to  be  enhanced,  that  is,  paradigms  in  which  neither  the 
distinctiveness  nor  the  probability  of  occurrence  of  the  stimuli 
to  be  memorized  are  manipulated,  A  seminal  study  in  this 
respect  is  that  by  Sanquist,  Rohrbaugh,  Syndulko  and  Lindsley 
(1980)  which  found  that  larger  amplitude  P300s  (or  late  positive 
components)  were  elicited,  in  a  same-different  judgment  task,  by 
stimuli  that  were  correctly  recognized  in  a  subsequent 
recognition  test.  Johnson,  Pfefferbaum,  and  Kopell  (1985) 
recorded  ERPs  in  a  study- test  memory  paradigm.  They  reported 
that  the  P300  associated  with  subsequently  recognized  words  was 
slightly,  but  not  significantly,  larger  than  that  elicited  by 
non-recognized  words.  Paller,  Kutas,  and  Mayes  (1985)  recorded 
ERPs  in  an  incidental  memory  paradigm  in  which  the  subjects  were 
asked  to  make  either  a  semantic  or  a  nonsemantic  decision,  and 
were  subsequently,  and  unexpectedly,  tested  for  their  recognition 
or  recall  of  the  stimuli.  They  found  that  ERPs  elicited  during 
the  decision  task  were  predictive  of  subsequent  memory 
performance,  in  that  the  P300  (late  positive  complex)  was  larger 
for  words  subsequently  recalled  or  recognized  than  to  words  not 
recalled  or  recognized.  Similarly,  Paller,  McCarthy,  and  Wood 
(1987)  recorded  ERPs  in  two  semantic  judgment  tasks,  which  were 
followed  by  a  free  recall  and  by  a  recognition  test.  ERPs  to 
words  later  remembered  were  more  positive  than  those  to  words 
later  not  remembered,  even  though  the  memory  effect  was  smaller 
for  recognition  than  for  recall.  Neville,  Kutas,  Chesney,  and 
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Schmidt  ■(  t9 8 6 )  recorded  ERPs  to  words  that  were  either  congruous 
or  incongruous  with  a  preceding  sentence  in  a  task  in  which 
subjects  were  asked  to  judge  whether  or  not  the  word  was 
congruent  with  the  sentence.  They  found  that  the  amplitude  of  a 
late  positive  component  (P650)  predicted  subsequent  recognition. 

Taken  together,  the  experiments  described  in  this  section 
suggest  that  the  P300,  as  well  as  other  late  components  of  the 
ERP,  can  have  an  important  role  in  illuminating  some  of  the 
cognitive  processes  that  accompany  stimulus  encoding,  and  that 
are  usually  opaque  to  traditional  techniques.  While  the  first 
group  of  studies  conducted  in  our  laboratory  has  focussed  on  the 
interaction  between  distinctiveness  and  rehearsal  strategies  in 
determining  the  ERP/memory  relationship,  the  other  studies  have 
shown  that  such  a  relationship  is  widespread  over  a  variety  of 
different  memory  paradigms.  It  is  important  to  emphasize  that 
memory  is  a  complex  phenomenon  that  can  be  influenced  by  a 
multitude  of  variables  and  that  can  be  probed  with  a  number  of 
different  tests.  Thus,  it  is  unlikely  that  a  single  component  of 
the  ERP  can  be  identified  as  the  "memory  component."  It  is  much 
more  likely,  and  indeed  more  interesting,  that  a  series  of  ERP 
components  will  prove  to  be  significant  in  different  memory 
tasks. 

4.4.2  The  M400 .  The  N400  component  of  the  ERP  was  first 
described  by  Kutas  and  Hillyard  (1980a),  who  recorded  ERPs  in  a 
sentence  reading  task.  In  this  paradigm,  words  are  presented 
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serially  and  the  subject  is  asked  to  read  them  silently  in  order 
to  answer  questions  about  the  content  of  the  sentence  at  the  end 
of  the  experiment.  In  two  studies  reported  by  Kutas  and  Hillyard 
(1980a),  25%  of  the  sentences  ended  with  a  semantically 
incongruous  (but  syntactically  correct)  word.  These  incongruous 
words  elicited  an  N400  component  that  was  larger  than  that 
elicited  by  words  that  were  congruous  with  respect  to  the  meaning 
of  the  sentence.  Furthermore,  the  amplitude  of  the  N400  appeared 
to  be  proportional  to  the  degree  of  incongruity:  moderately 
incongruous  words  ("he  took  a  sip  from  the  waterfall")  had  a 
smaller  N400  than  strongly  incongruous  words  ("he  took  a  sip  from 
the  transmitter").  Kutas  and  Hillyard  (1982)  reported  that  the 
N400  to  incongruous  endings  was  slightly  larger  and  more 
prolonged  over  the  right  than  the  left  hemisphere. 

These  results  have  been  replicated  and  extended  repeatedly, 
using  variations  of  the  sentence  reading  paradigm  described 
above.  The  aim  of  these  studies  has  been  to  determine  whether 
the  N400  is  a  manifestation  of  a  distinctively  semantic  process, 
or  whether  it  is  elicited  by  other  kinds  of  deviance.  Kutas  and 
Hillyard  (1984)  found  that  the  amplitude  of  the  N400  was 
inversely  related  to  the  subject's  expectancy  of  the  terminal 
word  (cloze  probability) ,  while  it  was  insensitive  to  sentence 
constraints  (i.e.,  to  the  number  of  possible  alternative 
endings).  Kutas  and  Hillyard  (1980b)  recorded  ERPs  to  the 
terminal  word  in  a  sentence.  This  word  could  be  semantically  or 
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physically  deviant,  or  not  deviant-  They  interpreted  their  data 
as  indicating  that  an  N400  followed  semantic  deviation,  while  a 
late  positive  complex  (P300)  followed  physical  deviation4.  In 
addition,  Kutas  and  Hillyard  (1983)  inserted  a  number  of  semantic 
and  grammatical  anomalies  in  prose  passages.  They  found  that 
large  N400s  were  associated  with  the  semantic  anomalies  embedded 
in  the  text,  while  the  negativities  recorded  to  grammatical 
errors  were  inconsistent  and  had  a  scalp  distribution  different 
from  that  of  the  N400.  Kutas,  Lindamood,  &  Hillyard  (1984)  found 
that  anomalous  words  that  were  semantically  related  to  the 
sentence's  "best  completion'1  ("the  pizza  was  too  hot  to  drink") 
elicited  smaller  N400s  than  anomalous  words  unrelated  to  the  best 
completion  ("the  pizza  was  too  hot  to  cry").  This  suggests  that 
the  degree  of  semantic  relatedness  is  an  important  determinant  of 
the  N400  (see  Figure  16) . 

A  large  N400  component  is  also  evoked  by  semantic  anomalies 
presented  in  the  auditory  modality  (McCallum,  Farmer,  &  Pocock, 
1985) ,  or  in  anomalies  embedded  in  American  Sign  Language  (ASL) 
gestures  (Neville,  1985)  .  However,  Kacar  and  Besson  (1987)  did 
not  find  N400  responses  to  anomalous  endings  of  musical  tunes. 

N400-like  components  have  also  been  recorded  in  paradigms 
other  than  sentence  reading.  For  example,  Fischler,  Bloom, 

4  We  should  note  that  there  is  some  controversy  concerning 
whether  or  not  a  P300  follows  the  occurrence  of  an  N400  in  the 
case  of  semantic  deviation.  For  present  purposes,  it  is 
sufficient  to  note  that  the  physical  deviation  was  na£  followed 
by  an  N400. 
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Childers,  -  Roucos,  and  Perry  (1983)  recorded  ERPs  in  a  sentence 
verification  paradigm.  In  this  paradigm,  sentences  are  presented 
in  segments  ("a  robin  /  is  /  a  bird"),  and  two  dimensions  are 
orthogonally  manipulated:  whether  the  sentences  are  positive  or 
negative  ("is,"  "is  not"),  and  whether  they  are  true  or  false. 

The  subject  is  required  to  indicate  whether  the  sentence  is  true 
or  false.  A  large  negativity  was  elicited  by  false  affirmative 
("a  robin  /  is  /  a  tree")  and  true  negative  ("a  robin  /  is  not  / 
a  tree")  sentences,  that  is  by  sentences  in  which  the  first  and 
last  element  were  semantically  unrelated.  In  a  similar  study 
(Fischler,  Childers,  Achariyapaopan,  &  Perry,  1985)  subjects  were 
required  to  learn  the  occupations  of  fictitious  people.  They 
were  then  asked  to  indicate  whether  sentences  of  the  type  "John  / 
is  /  a  dentist"  were  true  or  false.  They  found  that  false 
statements  were  associated  with  larger  negativities  than  true 
statements. 

Rugg  recorded  N400-like  components  in  a  semantic  priming 
paradigm  (Rugg,  1985)  and  in  a  rhyme  judgment  task  (Rugg  & 
Barrett,  submitted) ,  although  there  are  some  differences  between 
the  scalp  distribution  of  the  negativity  recorded  in  these 
paradigms  and  that  recorded  in  the  sentence  reading  task. 

In  general,  research  reviewed  in  this  section  suggests  that 
there  is  a  brain  response  (the  N400)  that  is  specifically 
sensitive  to  the  violation  of  semantic  expectancies.  Measures  of 
this  component,  then,  should  prove  useful  in  testing  theories  and 
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models  relating  to  semantic  priming  (e.g.  Van  Petten  &  Kutas, 

I 

1987)  . 
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S.  Potential  Applications  of  ERP  Research 

In  this  section  we  consider  the  possible  contribution  of  ERP 
research  to  problems  that  arise  in  various  branches  of 
psychology,  with  a  particular  emphasis  on  social  psychology. 

Before  we  review  some  specific  research  examples,  we  need  to 
consider  two  general  issues  related  to  the  application  of  ERP 
research . 

First,  as  we  noted  earlier,  for  some  ERP  components  there  is 
sufficient  knowledge  about  their  Physiological  significance  for 
them  to  be  used  as  markers  of  physiological  functioning.  This 
knowledge  has  led  to  the  use  of  ERPs  in  neurological  diagnosis 
(see,  for  example,  Halliday,  1987  for  a  review) .  Second,  we 
have  devoted  much  of  this  chapter  to  a  discussion  of  the 
psychological  significance  of  ERP  components  -  that  is,  we  have 
attempted  to  show  that  the  components  are  manifestations  of 
particular  cognitive  activities.  One  implication  of  this  is 
that,  to  the  extent  that  questions  that  arise  in  any  branch  of 
psychology  can  be  recast  in  terms  of  questions  about  cognitive 
activities,  there  is  at  least  the  possibility  that  the 
measurement  of  ERPs  will  be  useful. 

In  the  following  sections,  we  review  several  examples  of  how 
such  research  has  proceeded  or  might  proceed.  For  reviews  of 
other  applications  of  ERP  research  to  cognitive  and  engineering 
psychology,  see  Donchin,  Karis,  Bashore,  Coles,  and  Gratton 
(1986)  and  Donchin,  Kramer,  and  Wickens  (1986). 
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S.l  ERPs  and  emotion 

5.1.1  Emotion  and  cognition.  In  previous  sections  of  this 
chapter,  we  have  emphasized  the  view  that  ERPs  can  be  regarded  as 
manifestations  of  cognitive  processes.  For  the  most  part,  the 
ERP  literature  contains  few  studies  of  ERP-emotion  relationships. 
This  tendency  to  emphasize  the  cognitive  rather  than  the 
emotional  is  probably  due  to  the  belief  that  the 
psychophysiological  analysis  of  emotions  is  more  the  province  of 
students  of  the  autonomic  nervous  system.  However,  recent 
attempts  to  re-evaluate  the  distinction  between  cognition  and 
emotion  call  into  question  the  justification  for  their 
psychophysiological  separation.  For  example,  if  the  experience 
of  emotion  is  attributed,  at  least  in  part,  to  the  process  of 
cognitive  appraisal  (e.g.  Arnold,  1960) ,  then  the  so-called 
"cognitive"  ERPs  have  the  potential  to  be  useful  in  the  analysis 
of  appraisal  processes.  Similarly,  if  emotions  are  believed  to 
influence  cognitive  processes  (e.g.,  Bower,  1981),  it  may  be  that 
ERPs  will  be  useful  in  describing  such  an  influence. 

These  kinds  of  ideas  are  behind  the  recent  analysis  of  ERPs 
in  situations  in  which  subjects  anticipate  and  receive  emotional 
stimuli  (Johnston,  Hiller,  &  Burleson,  1986;  Klorman  &  Ryan, 

1980;  Simons,  MacMillan,  &  Ireland,  1982)  and  in  studies  of 
subjects  suffering  from  emotional  dysfunction  (Miller,  1986;  Yee 
&  Miller,  in  press) .  In  the  following  sections,  we  review  two 
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other  kinds  of  studies  in  which  there  is  evidence  of  concern  with 
cognitive-emotion  issues. 

5.1.2  Lie  detection.  In  preceding  sections,  we  have 
emphasized  that  ERPs  can  be  used  to  explore  situations  involving 
emotion,  if  these  situations  can  be  viewed  in  terms  of  the 
information  processing  activities.  In  the  case  of  lie  detection, 
studies  using  autonomic  measures  have  similarly  emphasized  the 
importance  of  cognitive  factors.  Thus,  rather  than  propose  a 
specific  emotional  "lie-response",  these  studies  have  sought  to 
use  the  measures  to  determine  whether  an  individual  has  specific 
knowledge  (cf.  the  guilty-knovledge  test,  Lykken,  1974;  Podlesny 
&  Raskin,  1977)  . 

In  the  case  of  studies  of  Lie  Detection  using  ERPs, 
researchers  have  used  a  paradigm  in  which  stimulus  words  or 
phrases  relating  to  a  crime  would  be  categorized  in  one  way  if 
the  information  they  represented  was  unknown  to  the  individual  - 
but  in  another  way,  if  the  information  was  known  (see  Farwell  & 
Donchin,  1986) .  The  role  of  the  ERPs,  then,  is  to  identify  the 
categorization  rule  being  used  by  the  subject.  This  is 
accomplished  by  using  a  variant  of  the  "oddball"  task.  Three 
classes  of  stimuli  are  presented:  (a)  words  or  phrases  related  to 
the  crime  (the  probe  stimuli) ;  (b)  words  of  phrases  unrelated  to 
the  crime  and  unknown  to  the  subject  (irrelevant  stimuli);  and 
(c)  words  or  phrases  unrelated  to  the  crime,  but  which  are  known 
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to  the  subject  (target  stimuli).  The  subject  is  instructed  to 
count  the  target  stimuli. 

Thus,  the  critical  question  is  whether  the  subject  will 
categorize  the  probe  stimuli  as  irrelevant  (indicating  no 
knowledge)  or  as  target  stimuli  (indicating  knowledge).  This 
question  is  answered  by  setting  up  the  probabilities  of  the 
different  stimuli  such  that  the  probe  stimuli  would  elicit  a 
large  response  (P300)  if  they  are  categorized  as  targets. 

Farwell  and  Donchin  (1986)  have  shown  that,  if  the  probabilities 
of  irrelevant,  probe  and  target  stimuli  are  .67,  .17,  and  .16 
respectively,  the  procedure  is  effective  in  determining  the 
categorization  rule  used  by  the  individual.  In  this  way,  the 
presence  of  guilty  knowledge  can  be  determined. 

In  this  example,  the  role  of  the  ERP  is  to  determine  whether 
a  particular  set  of  stimuli  is  classified  by  the  subject  as 
belonging  to  one  of  two  other  categories.  In  principle,  then,  it 
could  be  used  in  any  situation  in  which  the  experimenter  is 
interested  in  identifying  classification  rules. 

5.1.3  Bargaining.  Bargaining,  like  most  forms  of  social 
interaction,  involves  both  cognitive  and  emotional  processes. 
Thus,  bargaining  presents  an  interesting  topic  of  study  for  the 
analysis  of  cognitive-emotional  relationships. 

In  the  study  to  be  described  in  this  section  (Karis, 
Druckman,  Lissak,  &  Donchin,  1984),  the  focus  was  on  the 
cognitive  ERP  responses  to  events  believed  to  have  emotional 
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consequences.  The  subject  was  engaged  in  a  "bargaining"  session 
with  the  computer.  The  object  of  bargaining  was  the  purchase  of 
a  used  car.  The  subject  and  the  computer  alternated  in  making 
their  concessions,  until  they  agreed  on  a  price.  The  subject's 
objective  was  to  buy  the  car  at  the  lowest  possible  price.  ERPs 
were  recorded  while  the  subject  viewed  the  computer's  concession. 

An  interesting  aspect  of  this  study  was  that  the  computer 
adopted  either  one  of  two  strategies  in  making  its  concessions: 
it  could  be  either  "generous"  or  "stingy."  When  the  computer  was 
in  the  generous  mode  it  would  concede  80%  of  the  subject 
concession,  while  when  it  was  in  the  stingy  mode  it  would  concede 
only  20%  of  the  subject  concession.  Thus,  it  was  important  to 
the  subject  to  guess  the  computer's  strategy,  in  order  to  take 
advantage  of  it. 

Switches  in  the  computer's  strategy  (which  occurred 
randomly)  reliably  elicited  P300s  that  were  larger  than  those 
recorded  to  trials  in  which  the  strategy  remained  the  same.  This 
can  be  viewed  in  terms  of  cognitive  processes:  To  the  extent  that 
P300  represents  a  process  of  context  updating,  it  seems 
reasonable  that  the  subject  would  need  to  update  his/her  mental 
schema  at  the  moment  in  which  a  task  relevant  event  changes. 
Another  interesting  aspect  of  this  study  is  that  a  photograph  of 
the  subject's  face  was  taken  every  time  the  subject  saw  the 
computer's  concession.  The  facial  expression  of  the  subject  was 
then  analyzed  in  terms  of  the  emotions  displayed.  It  is 
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interesting  to  note  that,  on  some  occasions,  the  subject  smiled 
when  the  concession  was  generous.  Thus,  the  appraisal  of  a 
concession  (as  manifested  by  the  ERP)  was,  in  some  cases, 
associated  with  the  facial  expression  of  emotion. 

5.2  ERPs  and  "pay-off"  manipulations 

The  social  interaction  between  experimenter  and  subject  can 
have  an  influence  on  the  strategy  adopted  by  the  subject  in 
performing  his  or  her  assigned  task.  Subjects  approach 
experimental  situations  with  hypotheses  about  the  goals  of  the 
experiment  and,  most  importantly,  the  pay-offs  associated  with 
different  goals.  Information  about  these  pay-offs  can  be  deri\ad 
from  explicit  instructions  as  well  as  more  implicit  forms  of 
communication  between  subject  and  experimenter. 

In  most  ERP  studies,  we  try  to  constrain  the  subject's  task 
so  as  to  reduce  the  options  available  to  the  subject.  In  this 
way,  assumptions  can  be  made  about  the  processes  used  by  the 
subject  to  perform  the  task. 

In  several  studies,  instructions  or  pay-off  schedules  have 
been  deliberately  manipulated  in  order  to  alter  the  subject's 
strategy  in  performing  the  task.  In  this  case,  the  ERP  focus  is 
on  understanding  how  strategic  changes  are  implemented.  An 
example  of  this  approach  has  been  described  in  section  4.4.1 
where  we  saw  that  mnemonic  strategies,  manipulated  by 
instruction,  were  important  determinants  of  the  P300/recall 
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relationship  and,  by  implication,  of  the  role  of  the  rehearsal  of 
distinctiveness  cues  in  memory. 

In  the  following  two  sections,  we  review  other  examples  of 
this  kind  of  approach. 

5.2.1  Decision-making.  The  way  in  which  humans  reach 
decisions  is  an  important  problem  for  social  psychologists.  Most 
theories  of  decision-making  agree  that  subjective  probability  and 
pay-off  structure  are  fundamental  determinants  of  human  choices 
(Fishbein  4  Ajzen,  1975). 

Karis,  Chesney,  and  Donchin  (1983)  investigated  the  role  of 
these  two  dimensions  in  determining  the  amplitude  of  P300.  In 
this  study,  subjects  had  to  predict  which  of  three  stimuli  (the 
numbers  1,  2,  or  3)  would  next  occur.  The  probability  of  the 
three  stimuli  were  .45,  .10,  and  .45,  respectively.  The  subjects 
were  rewarded  for  correct  predictions  following  one  of  two 
schedules,  run  in  different  blocks.  In  one  schedule 
("all-or-none") ,  subjects  were  only  rewarded  for  correct 
predictions.  In  another  pay-off  schedule  ("linear”),  subjects 
were  not  only  rewarded  for  correct  choices,  but  also  received 
half  bonuses  when  their  predictions  were  off  by  one  (i.e.,  a  l 
occurred  when  a  2  was  predicted) .  Thus,  in  the  "linear" 
schedule,  the  optimal  strategy  was  to  predict  2  (objectively  the 
rare  event) . 

Karis  et  al.  found  that  although  subjects  did  indeed  vary 
their  overt  predictions  following  the  pay-off  schedules,  their 
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P3 00s  ware  always  larger  for  the  rare  stimuli  (the  2s) .  That  the 
subjects  were  aware  of  the  probability  structure  in  each  of  the 
two  conditions  was  confirmed  by  subjective  reports.  These 
results  indicate  that  P300  reflects  subjective  probability  rather 
than  a  conscious  decision  about  which  stimulus  was  most 
rewarding. 

Karis  et  al.  also  studied  variations  in  subject's  strategy. 

In  particular  they  were  able  to  quantify  the  "riskiness"  of  a 
particular  choice,  by  estimating  the  expected  loss  with  respect 
to  the  optimal  behavior.  They  observed  that  P300  amplitude  was 
proportional  to  the  riskiness  of  the  prediction.  They 
interpreted  this  result  in  terms  of  differences  in  the  relevance 
of  the  information  conveyed  by  the  stimulus  in  cases  of  high  and 
low  risk  choices. 

5.2.2  Speed-accuracy  instructions.  Kutas,  McCarthy,  and 
Donchin  (1977)  analyzed  the  relationship  between  P300  and 
reaction  time,  when  subjects  were  given  speed  or  accuracy 
instructions.  While  the  timing  of  the  reaction  time  response  in 
relation  to  the  P300  changed  with  instruction,  the  latency  of  the 
P300  itself  was  only  slightly  affected.  Thus,  in  this  case,  the 
effect  of  instructions  was  to  change  the  coupling  between  the 
processes  associated  with  P300  and  those  associated  with  overt 
behavior.  In  terms  of  information  processing,  it  appears  that 
speed  instructions  lead  the  subject  to  generate  an  overt  response 
before  information  has  been  fully  processed. 
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A  similar  modulation  of  the  relationship  between  the 
processing  of  stimulus  information  and  the  timing  of  the  overt 
response  has  been  described  by  Gratton  et  al.  (in  press;  see  also 
Gehring  et  al.,  1986).  In  this  case,  measures  of  the  readiness 
potential  (see  section  4.1.1)  were  diagnostic  of  the  strategic 
mode  of  the  subject  (accurate  versus  fast) . 

5 . 3  ERPs  and  communication 

In  this  chapter,  we  have  considered  ERPs  as  a  tool  for 
investigating  cognitive  processing.  In  particular,  we  have 
emphasized  how  ERPs  can  be  used  to  obtain  information  about 
cognitive  functions  which  would  not  be  available  in  other  ways. 

In  somewhat  more  extreme  terms,  we  may  consider  ERPs  as  a 
communication  device. 

An  interesting  example  of  this  approach  is  given  by  a  recent 
study  by  Farvell,  Donchin,  &  Kramer  (1986).  These  investigators 
have  devised  a  technique  that  allows  people  to  communicate  by 
exploiting  the  relationship  between  ERPs  and  attention.  Subjects 
in  this  study  were  presented  with  a  matrix  of  letters  that  were 
illuminated  in  rapid  succession.  By  examining  the  ERP  responses 
to  each  of  the  flashes,  they  were  able  to  determine  the  letters 
to  which  the  subjects  were  attending.  Thus,  letter  by  letter, 
the  subjects  were  able  to  communicate  entire  words,  by  means  of 
their  ERPs.  This  type  of  communication  device  may  be 
particularly  useful  for  cases  in  which  people  cannot  communicate 
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in  other  ways,  such  as  in  cases  of  patients  that  have  lost  any 
motor  ability  as  a  consequence  of  trauma  and  disease. 

5.4  summary 

In  the  examples  reviewed  in  this  section,  we  have  seen  how 
ERPs  can  be  used  to  study  or,  in  one  case,  enhance  social 
behavior,  broadly  defined.  The  underlying  theme  of  these 
examples  has  been  that,  if  the  question  of  interest  can  be 
considered  in  terms  of  the  information  processing  transactions 
that  occur,  ERPs  may  be  a  useful  tool.  This  is  because  ERPs  can 
be  regarded  as  manifestations  of  information  processing 
activities. 

Upon  analysis,  there  appear  to  be  several  other  instances  in 
which  such  a  conception  is  quite  feasible,  such  as  when  the 
interest  of  the  investigator  is  on  the  cognitive  processing  of 
social  information.  For  example,  the  recent  development  of 
"social  cognition"  as  a  sub-discipline  within  social  psychology 
has  led  to  the  treatment  of  a  subset  of  "social  processes"  as 
involving  the  operation  of  the  cognitive  apparatus  on  social 
information  (see,  for  example,  Wyer  &  Srull,  1984).  The  only 
restriction  in  the  application  of  ERPs,  to  this  domain  is  that  the 
information  be  presented  discretely  so  that  segments  of  the  EEG 
can  be  identified  that  are  time-locked  to  the  events  of  interest. 

In  other  cases,  considerable  efforts  may  be  needed  in  order 
to  apply  the  model  of  cognitive  psychophysiology.  For  example, 
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evaluations  of  social  interactions  would  require  not  only  that 
the  interactions  be  structured  so  as  to  permit  discrete  events  to 
be  identified.  One  would  also  need  to  incorporate  repetitions  of 
the  "same”  event  into  the  interaction  (if  averaging  is  necessary 
to  extract  stable  ERPs)  and,  most  importantly,  to  identify  the 
putative  cognitive  processes  associated  with  the  events.  The 
imposition  of  this  kind  of  structure  may  result  in  a  distortion 
of  the  essence  of  the  interaction  process. 

On  the  basis  of  the  foregoing  examples  and  discussion,  there 
appears  to  be  good  reason  to  expect  considerable  benefit  from  an 
application  of  a  cognitive  psychophysiology  based  on  ERPs  in 
various  areas  within  psychology,  particularly  social  psychology. 
Just  as  problems  in  cognitive  psychology  are  proving  more 
tractable  when  psychophysiological  measures  are  added  to  those 
measures  traditionally  available,  problems  in  social  psychology 
may  prove  to  be  similarly  more  tractable.  However,  the  benefits 
of  such  an  application  will  only  accrue  if  the 
psychophysiological  measures  are  derived  under  appropriate 
circumstances  and  if  such  circumstances  can  be  created  without 
distortion  of  the  phenomenon  of  interest. 
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7 .  figure  Legends 

Figure  l.  schematic  representation  of  the  operations 
involved  in  the  recording  of  Event-Related  brain  Potentials. 

From  left  to  right:  (a)  Top  view  of  the  head,  indicating  the 
placements  of  five  electrodes  (Fz,  C3 ,  Cz ,  C4 ,  and  Pz)  from  which 
EEG  is  recorded  -  note  that  other  locations  are  also  frequently 
used;  (b)  the  EEG  signal  is  then  transferred  to  an  amplifying  and 
filtering  system;  (c)  the  amplified  and  filtered  signal  may  be 
stored  temporarily  on  an  analog  magnetic  tape;  (d)  the  analog 
signal  is  then  converted  into  a  digital  signal  by  sampling  the 
potential  at  a  high  frequency  (usually  at  least  100  Hz)  by  an 
Analog-to-Digital  converter;  (e)  the  digitally  transformed  signal 
may  be  stored  on  a  digital  store  device  (magnetic  tape  or  disc) ; 
(f)  finally,  ERPs  are  extracted  from  the  digitized  EEG  signal  by 
averaging  point-by-point  across  a  large  sample  of  trials  (more 
than  20) . 

Figure  2.  A  schematic  representation  of  ERP  components 
elicited  by  auditory,  infrequent  target  stimuli.  The  three 
panels  represent  three  different  voltage  x  time  functions:  the 
left  bottom  panel  shows  the  very  early  sensory  components  (with  a 
latency  of  less  than  10  ms) ;  the  left  top  panel  shows  the  middle 
latency  sensory  components  (with  a  latency  of  between  10  and  50 
ms) ;  and  the  right  panel  shows  late  components  (latency  exceeding 
50  ms)  .  Note  the  different  voltage  and  time  scales  used  in  the 
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three  panels,  as  well  as  the  different  nomenclatures  used  to 
label  the  peaks  (components).  (Copyright  1979,  Plenum  Publishing 

corporation.  Adapted  with  permission  of  the  author  and  publisher 
from  Donchin,  1979.) 

Figure  3.  Schematic  representation  of  configuration  of 
neurons  whose  simultaneous  polarization  does  or  does  not  result 
in  a  potential  detectable  by  a  distant  electrode.  The  electric 
fields  generated  by  polarization  of  neurons  organized  in  layers 
(such  as  those  shown  in  the  left  panel)  add  together  to  form  a 
powerful  field  that  can  be  detected  from  a  distant  (e.g.,  scalp) 
electrode  ("open  field").  The  fields  generated  by  neurons 
organized  concentrically  (such  as  those  shown  in  the  right  panel) 
cancel  each  other  to  produce  a  very  small  field  that  cannot  be 
detected  by  a  scalp  electrode  ("closed  field").  (Copyright  1947, 
Alan  R.  Liss,  Inc.  Reprinted  with  permission  of  the  author  and 
publisher  from  Lorente  de  Ho,  1947.) 

Figure  4.  ERPs  elicited  by  counted,  rare  tones  (upper 
panel).  The  data  recorded  with  four  different  high-pass  filter 
settings  ("time  constant")  are  superimposed.  Stimulus  occurrence 
is  indicated  by  an  S  on  the  time  scale.  Calibration  pulses 
(lower  panel)  are  plotted  on  the  same  voltage  x  time  scale  as  the 
ERPs.  Note  the  reduction  in  amplitude  and  deformation  of  the  ERP 
waveshape  produced  by  progressively  shorter  time  constants,  that 
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reduce  Tow  frequency  activity.  (Copyright  1979,  The  Society  for 
Psychophysiological  Research.  Reprinted  with  permission  of  the 
author  and  publisher,  from  Duncan-Johnson  &  Donchin,  1979.) 

Figure  5.  Schematic  representation  of  an  ERP  waveform, 
indicating  different  procedures  for  component  quantification. 
Three  types  of  peak  measures  are  indicated.  The  peak  latency  is 
obtained  by  measuring  the  interval  (in  ms)  between  the  external 
triggering  event  and  a  positive  or  negative  peak  in  the  waveform. 
The  base-to-peak  amplitude  measure  is  obtained  by  computing  the 
voltage  difference  (in  microvolts)  between  the  voltage  at  the 
peak  point  and  a  baseline  level  (usually  the  average  pre-stimulus 
level) .  The  peak-to-peak  amplitude  measure  is  obtained  by 
computing  the  voltage  difference  (in  microvolts)  between  the 
voltage  at  the  peak  point  and  the  voltage  at  a  previous  peak  of 
opposite  polarity.  The  area  measure  is  obtained  by  integrating 
the  voltage  between  two  timepoints. 

Figure  6.  Example  of  the  application  of  Principal 
Component  Analysis  (PCA)  to  the  study  of  ERPs.  A  series  of  ERP 
waveforms  (whose  grandaverage  is  plotted  in  the  upper  panel  of 
the  figure)  are  decomposed  in  several  constituent  components, 
whose  time  courses  (component  loadings)  are  shown  in  the  lower 
panel  of  the  figure.  The  component  loadings  are  then  used  to 
determine  the  degree  to  which  a  particular  component  is  present 
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in  each  of  the  ERP  waveforms.  (Copyright  1982,  Elsevier  Science 
Publishers.  Reprinted  with  permission  of  the  author  and 
publisher  from  Duncan-Johnson  &  Donchin,  1982.) 

Figure  7.  Example  of  the  application  of  Vector  Filtering 
to  the  study  of  ERPs.  Waveforms  from  three  electrode  locations, 
shown  in  the  upper  panel,  are  used  to  determine  the  contribution 
of  a  particular  component  (e.g.  P300) ,  which  is  characterized  by 
a  specific  profile  of  amplitudes  at  the  different  electrode 
locations.  The  result  of  this  operation  is  shown  in  the  lower 
panel . 


Figure  8.  Grand-average  ERP  waveforms  at  Pz  from  10 
subjects  for  counted  (high  -  left  column)  and  uncounted  (low  - 
right  column)  stimuli  (tones) ,  with  different  a  priori 
probabilities.  The  probability  level  is  indicated  by  a 
percentage  value  beside  each  waveform.  Waveforms  from  a 
condition  in  which  the  subjects  were  instructed  to  ignore  the 
stimuli  are  are  also  presented  for  a  comparison.  The  occurrence 
of  the  stimulus  is  indicated  by  a  black  bar  on  the  time  scale. 
Positive  voltages  are  indicated  by  downward  deflections  of  the 
waveforms.  Note  that  P300  amplitude  is  inversely  proportional  to 
the  probability  of  the  eliciting  stimulus  ("probability  effect") , 
and,  at  the  same  probability  level,  P300  is  larger  for  counted 
than  uncounted  stimuli  ("target  effect").  (Copyright  1977,  The 
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Society  -for  Psychophysiological  Research.  Reprinted  with 
permission  of  the  author  and  publisher  from  Duncan-Johnson  & 
Donchin,  1977 .  ) 

Figure  9.  ERP  waveforms  at  Pz  averaged  across  subjects  for 
three  different  semantic  categorization  tasks.  The  solid  line 
indicates  ERPs  obtained  during  a  task  in  which  the  subjects  had 
to  distinguish  between  the  word  DAVID  and  the  word  NANCY  (the  FN 
condition) .  The  dotted  line  indicates  ERPs  obtained  during  a 
task  in  which  the  subjects  had  to  decide  whether  a  word  presented 
was  a  male  or  a  female  name  (the  VN  condition) .  The  dashed  line 
indicates  ERPs  obtained  during  a  task  in  which  the  subjects  had 
to  decide  whether  a  word  was  or  was  not  a  synonym  of  the  word 
PROD  (SYN  condition) .  These  three  tasks  were  considered  to 
involve  progressively  more  difficult  discriminations.  Note  the 
latency  of  P300  peak  is  progressively  longer  as  the 
discrimination  is  made  more  difficult.  (Copyright  1977,  The 
AAAS.  Adapted  with  permission  of  the  author  and  publisher  from 
Kutas,  McCarthy,  6  Donchin,  1977.) 

Figure  10.  Accuracy  of  reaction, time  responses  given  at 
different  latencies  ("speed-accuracy  functions")  for  trials  with 
"fast"  and  "slow"  P300.  Response  latency  (defined  in  terms  of 
the  onset  of  the  EMG  response)  is  plotted  on  the  abscissa.  The 
probability  that  a  response  would  be  correct  is  plotted  on  the 
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ordinate'.  Note  that  the  probability  of  giving  a  correct  response 
increases  as  response  latency  increases.  At  very  short  response 
latencies,  responses  are  at  a  chance  level  of  accuracy  (.5).  At 
long  response  latencies,  responses  are  usually  accurate.  The 
speed/accuracy  function  for  those  trials  with  P300  latency 
shorter  than  the  median  latency  ("fast  P300"  trials)  are 
indicated  by  solid  lines.  The  speed/accuracy  function  for  those 
trials  with  P300  latency  longer  the  median  latency  ("slow  P300" 
trials)  are  indicated  by  dashed  lines.  Note  that,  for  each 
response  latency,  the  probability  of  giving  a  correct  response  is 
higher  when  the  P300  on  that  trial  (reflecting  the  speed  of 
stimulus  processing  on  that  trial)  is  fast  than  when  it  is  slow. 
(Copyright  1985,  The  American  Psychological  Association. 

Reprinted  with  permission  of  the  publisher  from  Coles,  Gratton, 
Bashore,  Eriksen,  &  Donchin,  1985.) 

Figure  11.  Typical  movement  related  potential  (recorded 
from  a  central  electrode  -  Cz)  preceding  a  voluntary  hand 
movement.  Note  that  the  potential  begins  about  1  sec  before  the 
movement  (indicated  by  the  dashed  vertical  line).  The  potential 
can  be  subdivided  into  different  components  as  follows:  N1  (RP  - 
Readiness  Potential,  BSP  -  Bereitschaftspotential)  ,*  N2  (MP  - 
Motor  Potential) ;  and  the  P2  (RAF  -  Reafferent  Potential) . 
(Copyright  1980,  Elsevier  Science  Publishers.  Adapted  with 
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permission  of  the  author  and  publisher  from  Kutas  &  Donchin, 

1980. ) 

Figure  12.  Schematic  representation  of  a  typical  contingent 
negative  variation  (CNV)  recorded  from  Cz.  The  CNV  is  the 
negative  portion  of  the  wave  between  the  presentation  of  the 
warning  and  imperative  stimuli.  The  early  portion  of  the  CNV  is 
labelled  "0-wave"  (or  Orienting  wave) ,  while  the  late  portion  is 
labelled  "E-wave"  (or  Expectancy  wave).  (Copyright  1983, 

Elsevier  Science  Publishers.  Adapted  with  permission  of  the 
author  and  publisher  from  Rohrbaugh  &  Gaillard,  1983.) 

Figure  13.  The  effect  of  attention  on  early  components  of 
the  auditory  event-related  potential  recorded  at  the  central 
electrode  (Cz) .  The  left  panel  shows  ERPs  for  tones  presented  in 
the  left  ear.  Note  that,  the  difference  between  the  ERPs  to 
attended  tones  (solid  line)  versus  those  for  unattended  tones 
(dashed  line)  consists  of  a  sustained  negative  potential.  A 
similar  difference  can  be  seen  for  tones  presented  to  the  right 
ear  (see  right  panel) .  (Copyright  1981,  Elsevier  Science 
Publishers.  Adapted  with  permission  of  the  author  and  publisher 
from  Knight  et  al.,  1981.) 

Figure  14.  The  effects  of  deviance  on  "mismatch  negativity". 
A  standard  (80  db)  tone  was  presented  on  90%  of  the  trials  and  a 
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deviant  tdne  (57,  70  or  77  db,  in  different  blocks)  was  presented 
on  10%  of  the  trials.  The  ERP  to  the  standard  is  indicated  by 
the  thin  line  in  each  panel;  the  ERP  to  the  deviant  tone  is 
indicated  by  the  thick  line.  As  the  degree  of  mismatch  between 
stimuli  increases,  the  mismatch  negativity  also  increases.  (The 
magnitude  of  the  difference  between  standard  and  deviant  ERPs 
increases.)  (Copyright  1987,  The  Society  for  Psychophysiological 
Research.  Adapted  with  permission  of  the  author  and  publisher 
from  Naatanen  &  Picton,  1987.) 

Figure  15.  ERPs  elicited  by  "isolated"  words  that  were 
later  recalled  (solid  line)  or  not-recalled  (dashed  line) .  The 
left  column  shows  ERPs  for  subjects  who  used  rote  mnemonic 
strategies;  the  right  column  shows  ERPs  for  subjects  who  used 
elaborative  strategies.  Note  that  the  amplitude  of  P300  is 
related  to  subsequent  recall  for  the  rote  memorizers,  but  not  for 
elaborators.  (Copyright  1986,  Elsevier  Science  Publishers. 
Reprinted  with  permission  of  the  publisher  from  Karis,  Fabiani,  & 
Donchin,  1986b.) 

Figure  16.  The  effects  of  anomalous  sentence  endings  on  the 
N400.  The  ERPs  (from  Pz)  depicted  in  the  figure  were  recorded 
following  visual  presentation  of  words  that  varied  in  their 
relationship  to  the  previous  words  in  the  sentence.  For  example, 
for  sentences  such  as  "The  pizza  was  too  hot  to  ...”,  three 
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endings  were  possible.  Best  completion:  "eat";  Related  anomaly: 
"drink";  Unrelated  anomaly;  "cry".  Note  that  the  N400  component 
is  only  present  for  anomalies,  and  is  larger  for  unrelated  than 
for  related  anomalies.  (Copyright  1984,  Lawrence  Erlbaum 
Publishers.  Adapted  with  permission  of  the  author  and  publisher 
from  Kutas,  Lindamood,  &  Hillyard,  1984.) 
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I.  Introduction 

In  this  paper  vve  shall  contme  our  attention  to 
the  i ole  ol  theory  in  cognitive  psychophysiolo¬ 
gy  I  he  haste  assumption  ol  the  cognitive 
psw hophssioloeteal  approaeh  is  that  cognitive 
ivtiwu  is  implemented  in  the  nervous  system 
I',  nivalis  ol  physiological  changes.  Given  this 
issumption.  it  is  possible  that  a  variety  of 
jwv  .f.  .physiological  measures  may  prove  use- 
hi  the  evploiation  t>l  cognitive  lunetion. 
II  'uevet.  »v‘  shall  locus  our  attention  on 
■m  I'utes  ol  the  event-related  brain  potential 
I  K  I1 1  i  lot  an  example  ot  another  physiologic- 
■  system,  see  yan  der  Molen.  Somsen. 

.  n.vheke.  lo,ss| 

.  hay  v'  atgueil  previously  (see.  for  exam- 
t  ■ 'les  ,v  (iratton.  Ilhs5.  ldSti;  Donchin. 

|  .  -  ,X  t  itatton.  ITS4.  and  see  also  Donchin. 

■'ll  that  cognitive  psychophysiology  repre- 
r-  an  attempt  to  understand  human  cogni- 
. .  '..iKiion  in  w Inch  traditional  'cognitive' 

- , . t v s  ueavtion  time,  errot  rate,  etc.)  are 
n.  ni v d  by  psyehophystological  measures 
'..ne  to  understand  human  cognitive 
yyv  avlopt  a  general  information 
-uc  voneeption.  in  which  a  particular 
.  n.iy  i. u ul  output .  lollow mg  some  input. 

■  ,'L.twi  to  the  activities  ot  various 
i  c  piovcsses  Given  this  framework. 
\.n  uselul  to  try  to  account  tor 
' .11  behavioral  output  in  terms  of 
ai  a  subset  ol  the  intervening 
i  Hiiput  y .inability  may  be  observed 
■■  a  i  vt  m  m  I  us  condition ).  slate,  subject 
•■■■i.  ilitv  ace  or  subject  strategy 


vary.  In  seeking  to  account  for  output  variabil¬ 
ity.  we  propose  that  the  psyehophysiologual 
measures  are  sensitive  to  particular  aspects  ol 
human  information  processing  (the  interven¬ 
ing  processes'  mentioned  above)  It  is  our 
working  hypothesis  that  the  traditional  mea¬ 
sures  of  behavioral  output  depend  on  u  larger 
set  of  processes  than  do  particular  psychophy- 
siological  measures.  Thus,  psychophysiologicul 
measures  should  enable  us  to  localize  the 
source  of  output  variability .  We  should  empha¬ 
size  that,  while  our  strategy  has  been  to  try  to 
account  for  behavioral  output,  our  primary 
interest  is  more  general — that  is,  we  want  to 
know  how  the  information  processing  system 
works. 

Note  that  in  the  preceding  paragraph,  we 
have  explicitly  made  a  number  ot  theoretical 
assumptions.  We  adopted  the  computer 
metaphor'  and  assumed  thut  human  behavior, 
in  response  to  a  stimulus,  is  determined  by  the 
activities  of  several  intervening  processes.  The 
number,  nature,  and  mode  ot  interaction  ot 
these  intervening  processes  were  not  specified. 
However,  different  theories  in  cognitive 
psychology  provide  a  variety  of  alternative, 
competing  specifications  which  can  be  used 
both  to  guide,  and  be  tested  by.  research  in 
cognitive  psychophysiology.  We  will  now  re¬ 
view  these  ’guidance'  functions  of  cognitive 
theory. 

II.  The  role  or  theory 

In  claiming  that  psychophysiological  measures 
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BEHAVIORAL  AND  BRAIN  SCIENCES 


FINAL  DRAFT  FOR  COMMENTATORS 
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exceot  for  typographical  errors. 
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PRECOMMENTARY  ON  VERLAGER'S  CRITIQUE  OP 
THE  CONTEXT  UPDATING  MODEL 


Is  the  P300  componenent  a  manifestation  of  context  updating? 


Emanuel  Donchin  and  Michael  G.H.  Coles 
Cognitive  Psychophysiology  Laboratory 
University  of  Illinois  at  Urb&na-Champalgn 
603  East  Daniel  Street 
Champaign  IL  61820 


ABSTRACT:  To  understand  the  endogenous  components  of  the  ERP  we 

must  use  data  about  the  information  processing  function  of  the 
underlying  brain  activity.  These  hypotheses  in  turn  generate 
testable  predictions  about  the  consequences  of  the  component.  We 
review  the  application  of  this  approach  to  the  analysis  of  the 
P300  component,  whose  amplitude  is  controlled  multiplicati vely  by 
the  subjective  probablity  and  the  task  relevance  of  the  eliciting 
events  and  whose  latency  depends  on  the  duration  of  stimulus 
evaluation.  These  and  other  factors  suggest  that  the  P300  is  a 
manifestation  of  activity  occuring  whenever  one's  model  of  the 
environment  must  be  revised.  Tests  of  three  predictions  based  on 
this  "context  updating"  model  are  reviewed.  Ver leger’s  critique 
is  based  on  a  misconstrual  of  the  model  as  well  as  on  a  partial 
and  misleading  reading  of  the  relevant  literature. 


@  1987  Cambridge  University  Press 


L  Introduction 

In  the  Forward  to  his  'New  Essays  on  the  Psychology  of  Art,"  Amheim  (1986)  makes  the  following 
observation, 

"Articles  in  professional  journals  profit  from  a  permanence  of  their  own,  albeit  one  ded  to  the 
a  me  of  publication,  A  discipline  grows  like  a  tree,  one  on  which  the  function  of  every  new 
twig  is  determined  by  its  place  in  the  whole.  Each  contribution  justifies  itself  by  addressing  a 
question  that  the  profession  has  put  on  the  agenda  at  that  time," 

It  would  have  been  useful  if  this  view  had  guided  Verleger  (target  article,  this  issue)  as  he  reviewed 
the  literature  on  the  P300  component  of  the  Event  Related  Brain  Potential  (ERP).  Verleger  trees 
the  body  of  publications  concerned  with  the  P300  as  if  it  had  ail  been  created  simultaneously.  He 
singles  out  selective  pieces  of  writing  originating  from  different  historical  contexts  to  patch  together 
the  theory  he  advocates  while  launching  a  rather  sharp  attack  on  two  versions  of  "context  updating," 
which  he  attributes  to  Donchin  and  his  colleagues.  However,  neither  of  these  versions  is  a  valid  or 
even  a  reasonable  approximation  to  our  ideas.  The  independence  of  his  discussion  from  the  histori¬ 
cal  context  may  be  what  makes  it  so  difficult  for  Verleger  id  see  how  inaccurately  he  represents  our 
views. 

One  example  will  illustrate  the  inadequacy  of  Verleger’s  survey  of  the  literature.  Investigators  of  the 
P300  are  keenly  aware  that  at  least  two  attributes  of  the  component  -  its  latency  and  its  amplitude  • 
must  be  measured  in  any  study.  Furthermore,  it  is  clear  that  the  amplitude  and  the  latency  are  each 
affected  by  somewhat  different  experimental  manipulations  and  that  the  variance  in  amplitude  and 
the  variance  in  latency  present  rather  different  theoretical  challenges.  A  substantial  literature  has 
accumulated  since  Ritter.  Stinson  and  Vaughan  (1972)  published  the  first  systematic  study  of  the 
relationship  between  the  latency  of  the  P300  and  reaction  time  (RT).  Indeed,  our  account  of  the 
functional  significance  of  the  P300,  which  Verleger  labels  the  "context  updating"  model,  evolved  in 
pan  from  an  attempt  to  account  for  the  seemingly  counterintuitive  dissociation  between  P300 
latency  and  RT. 

It  makes  little  sense,  therefore,  to  examine  the  context  updating  hypothesis  devoting  as  little  atten¬ 
tion  as  Verleger  does  to  the  latency  of  P300.  His  concentration  on  the  variance  in  P300  amplitude 
and  his  decision  to  interpret  the  words  "working  memory"  and  "expectancy"  in  the  narrowest 
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po sable  sense  creates  two  straw  men.  These  are  then  subjected  to  vigorous  criticism  in  an  ostensible 
refutation  of  our  model.  It  should  be  pointed  out  that  not  only  is  Verieger  attacking  the  wrong 
models,  his  attempted  critique  is  weakened  by  his  treatment  of  empirical  data.  Important  segments 
of  the  literature  are  ignored  (the  literature  on  P300  latency  is  but  one  example).  When  data  are  dis¬ 
cussed.  Verieger  offers  speculative  interpretations  that  merely  serve  to  fit  his  preconceptions. 

This  precommentary  begins  with  a  review  of  the  explanatory  tasks  an  ERP  investigator  must  under¬ 
take  at  different  phases  of  the  investigation.  We  then  present  our  own  view  of  the  "context  updating" 
model,  reviewing  the  data  on  which  we  base  this  view  as  we  proceed.  Where  appropriate,  we  also 
comment  on  Verleger’s  interpretations  of  the  data.  We  have  devoted  a  portion  of  this  precommen¬ 
tary  to  a  me ta theoretical  discussion  of  our  approach  to  the  study  of  ERP  components. 

2.  The  Endogenous  Components  of  the  ERP 

2.1.  Some  general  remarks  on  ERPs 

The  application  of  signal  averaging  methodology  to  the  study  of  brain  activity  which  is  time  locked 
to  specific  events  made  possible  the  study  of  event  related  brain  potentials  (ERPs)  in  awake  behav¬ 
ing  human  subjects.  ERPs  are  records  of  the  activity  of  neuronal  ensembles  activated  in  a  fixed  tem¬ 
poral  relation  to  an  event  whose  time  of  occurrence  can  be  made  known  to  a  computer  and  whose 
geometry  allows  their  individual  fields  to  summate  and  generate  a  field  large  enough  to  be  record¬ 
able  from  the  scalp.  (For  a  discussion  of  the  biophysical  and  neurophysiological  basis  of  research 
on  the  ERP  see  Allison  1984;  Allison,  Wood  and  McCarthy  1986;  Nunez  1981.) 

The  ERP  leaves  much  to  be  desired  as  a  window  on  the  brain.  The  neurophysiological  significance 
of  the  fields  is  a  matter  of  controversy.  The  specific  intracranial  sources  that  generate  the  potentials 
are.  in  the  main,  unknown.  Furthermore,  it  is  quite  possible  that  the  neuronal  ensembles  whose 
activity  is  recorded  in  this  fashion  do  not  constitute  distinct  and  unique  neuroanatomical  en rides 
(Allison  et  al  1986,  are  particularly  eloquent  on  the  deficiencies  of  ERPs  as  measures  of  brain 
activity).  Nonetheless,  ERPs  have  been  studied  with  vigor  for  the  past  two  decades  and  have 
engaged  the  interest  of  many  investigators.  This  interest  is  driven  by  the  one  advantage  the  ERP  has 
over  other  manifestations  of  brain  activity,  namely,  that  it  can  be  measured  in  the  awake,  behaving 
human.  The  recordings  can  be  done  without  recourse  to  invasive  procedures  and  at  a  relatively  low 
cost.  The  work  has  also  received  considerable  momentum  from  the  fan  that  such  recordings  are 
highly  sensitive  to  a  host  of  experimental  manipulations.  Whether  one  varies  the  physical  parame¬ 
ters  of  the  stimulation,  the  class  of  subjects  used  or  the  cognitive  operations  the  subjects  apply  to  the 
eliciting  events,  the  ERPs  show  patterns  that  are  stable  in  and  between  individuals  and  occasions. 
This  stability  of  pattern  in  the  ERP  has  proved  useful  in  the  development  of  various  diagnostic  pro¬ 
cedures  in  neurology,  audiology  and  psychiatry,  (e.  g.  Crecco  and  Bodis-Wollner  1986).  However, 
in  this  precommentary  we  will  focus  on  the  possibilities  and  problems  that  arise  when  one  attempts 
to  use  ERPs  as  tools  in  the  study  of  cognitive  function. 

In  the  middle  60s  it  became  apparent  that  the  ERPs  contain  "endogenous"  components  whose  vari¬ 
ance  appears  to  be  relatively  insensitive  to  the  physical  attributes  of  stimuli.  In  fact,  the  eliciting 
event  need  not  even  be  a  physical  stimulus  (Sutton,  Tuering,  Zubir  and  John  1967).  Endogenous 
components  are  sensitive  to  variation  in  the  information  processing  activities  imposed  on  the  subject 
by  the  assigned  tasks  (see  Donchin,  Ritter  and  Me  Call  urn  1978,  for  a  discussion  of  the  distinction 
between  exogenous  and  endogenous  components). 

2JL  Classes  of  Psycbophysiological  assertions 

Both  Verleger’s  target  article  and  this  precommentary  are  concerned  with  an  endogenous  com¬ 
ponent  which  was  first  described  by  Sutton,  Brarea,  Zubin  and  John  (1965)  who  reported  that 
stimuli  which  "resolve  the  subject's  uncertainty"  elicit  an  ERP  characterized  by  a  large  positive 
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component,  with  a  Latency  to  the  peak  of  about  300  msec,  which  will  be  referred  to  here  as  the 
P300.1 2  The  enterprise  Launched  by  Sutton  has  yielded  considerable  information  about  the  P30O^. 
An  examination  of  the  literature  reveals  that  investigators  concerned  with  the  P300  make  assertions 
which  can  be  classified  into  ooe  of  the  following  categories. 

12.1.  Statements  about  antecedent  conditions 

These  are  statements  about  the  functional  re  la  do  ns  hips  between  various  attributes  of  the  P300,  such 
as  its  amplitude  and  latency,  and  assorted  independent  variables.  For  example,  the  claim  that  the 
amplitude  of  P300  is  inversely  proportional  to  the  subjective  probability  of  the  eliciting  events, 
given  constraints  on  the  cask  relevance  of  the  events,  is  a  statement  about  the  conditions  antecedent 
to  the  elicitation  of  the  P300.  The  antecedent  conditions  of  the  P300  have  been  studied  in  detail  and 
there  is  enough  consensual  data  to  support  a  substantial  number  of  statements  about  its  antecedent 
conditions.  (For  detailed  reviews  of  the  relevant  Literature  see  Pritchard,  1981;  Johnson,  in  press). 
Such  statements  are  necessary  precursors  of  a  theory  of  the  P300.  However,  a  collection  of  such 
statements  does  not  constitute  an  adequate  theory. 

222.  Statements  about  Applications 

Even  though  statements  about  antecedent  conditions  ate  not  a  theory  of  the  P300,  they  can  serve  as 
the  basis  for  an  application  of  the  P300  in  some  domain  of  human  action  where  a  measurable  index 
of  some  construct  or  variable  is  needed.  Thus,  for  example,  the  observation  that  P300  latency 
depends  on  processing  time  has  led  to  the  suggestion  that  P300  latency  may  discriminate  the 
demented  from  the  depressed  (Goodin,  Squires  and  Stair  1978).  The  validity  of  this  diagnostic  use 
of  the  P300  depends  solely  on  empirically  demonstrable  diagnostic  power  and  does  not  require  a 
theoretical  understanding  of  the  component,  its  origin  or  its  functional  significance.  Similarly,  the 
fact  chat  P300  amplitude  reflects  the  subject's  focus  on  one  of  several  concurrently  assigned  tasks 
makes  it  possible  to  use  it  as  a  measure  of  "mental  workload"  (Donchin,  Kramer  and  Wiaceas 
1986a).  From  an  applied  perspective  such  assertions  are  quite  useful.  However,  they  treat  the  ERP 
component  as  an  index  and  do  not  necessarily  shed  light  on  its  functional  significance. 

22  J.  Generalizations  about  antecedent  conditions 

Several  investigators  have  attempted  to  integrate  the  many  empirical  statements  about  the  diverse 
independent  variables  which  affect  the  P300  into  a  coherent  statement  that  will  identify  the  com¬ 
monality  among  these  statements.  It  is  generally  ««■««***<  that  the  diversity  of  effects  is  deceptive 
and  that  there  is  a  common  denominator  to  the  various  conditions  »nAfr  which  the  P300  is  elicited. 
It  would  clearly  be  useful  to  have  such  a  generalization  so  that  one  could  predict  the  specific  effec 
of  any  given  manipulation  on  P’300.  This  approach  is  illustrated  by  Johnson’s  (1986)  proposal  thai 
the  amplitude  of  P300  is  controlled  by  an  additive  and  multiplicative  relationship  between  several 
variables.  The  interpretation  advocated  by  Verleger,  namely,  that  "P3s  are  evoked  by  stimuli  which 
are  awaited  when  subjects  are  dealing  with  repetitive,  highly  structured  tasks"  constitutes  another 
such  generalization  about  the  antecedent  conditions  of  P300.  Even  though  such  generalizations  are 

1  Components  of  the  ERP  are  labeled  by  a  law  inriiraring  polarity  and  a  number  indicating  their  nodal  latency 
Howe««r.  other  nomcnclannes  exist  and  this  very  same  component  may  be  Labeled  P3  by  some  auhon,  including  Vcr 
leper,  and  P3b  by  others. 

2  In  keeping  with  the  usage  in  Verleger ’i  target  article  we  will  ignore  the  controversy  regsrding  ’multiple  pontivi 
tiee.*  It  is  quite  likely  that  a  variety  of  processes  "wiifw  themselves  on  the  scalp  concurrently  with  those  manifesto 
by  P300.  although  the  vdor  with  which  this  proposition  is  embraced  varies  across  laboratories.  However,  a  P300-tik 
component  appears  in  everyone's  pantheon  of  components  and  it  is  in  this  component  that  we  direct  our  anrnhre*  in  thi 
precommenmy. 
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valuable  (and  necessary  precursors  to  a  theory),  they  leave  the  functional  significance  of  the  P3Q0 
unexplained. 

2.2.4.  Hypotheses  about  functional  significance 

When  we  refer  to  the  "functional  significance"  of  the  P300  we  imply  that  the  potentials  recorded 
from  the  scalp,  to  which  we  refer  as  the  P300,  are  manifestations  of  the  intracranial  activity  of  a 
specific  functional  entity.  As  we  have  noted  elsewhere  (Donchin,  Karis,  Bashore,  Coles  and  Cratton 
1986b),  it  is  not  necessary  that  the  brain  activity  recorded  as  the  P300  originate  in  a  single  intracra¬ 
nial  "source."  Nor  is  it  necessary  that  the  neuronal  ensembles  whose  activity  contributes  to  the  P30Q 
be  associated  with  a  single  neuroanatomical  entity.  Many  brain  structures  may  be  involved  and 
several  "sources"  (in  the  sense  of  specific  dipoles)  may  contribute  to  the  component.  All  that  is 
necessary  is  that  this  variety  of  neuronal  activities  that  participate  in  a  specifiable  information  pro¬ 
cessing  task  have  the  geometry  and  temporal  characteristics  so  that  they  contribute  to  the  P300.  The 
ultimate  goal  of  a  theory  of  P300  is,  we  suggest,  the  elucidation  of  the  nature  of  this  task. 

The  brain  is  evidendy  an  organ  whose  large  variety  of  activities  can  be  described  in  neurophysiolog¬ 
ical  terms.  There  is  no  question  that  the  ultimate  understanding  of  the  brain  must  include  the 
detailed  explication  of  its  functioning  as  a  mechanism  constructed  of  neurons.  However,  this  neu¬ 
ronal  machine  is  an  infortnadon  processing  device  and  the  myriad  of  biochemical  interactions  that  it 
performs  must  accomplish  tasks  that  can  be  described  in  information  processing  terms.  The  level  of 
resolution  one  adopts  to  describe  the  brain  is  dictated  by  one’s  purpose.  No  level  is  inherendy  more 
interesting  or  more  valid  than  other  levels;  each  descriptive  tier  is  optimal  for  some  purposes  and 
quite  the  opposite  for  others.  We  choose,  in  the  present  context  to  focus  on  descriptions  described 
stricdy  in  information  processing  terms. 


2.2.4.I.  The  Whirring  Noise  analogy 


Within  this  framework  we  view  the  P300  as  a  surface  manifestation  of  an  internal  information  pro¬ 
cessing  operation.  As  theorists  our  goal  is  to  infer  the  nature  of  that  underlying  processing.  Hence 
statements  about  the  antecedent  conditions  of  the  component  do  not  serve  our  purpose;  they  merely 
tell  us  what  needs  to  happen  outside  the  system  to  call  forth  the  activation  of  the  component.  That, 
however,  fails  to  explain  what  the  brain  is  actually  doing,  functionally  speaking,  when  the  com¬ 
ponent  is  active. 


An  example  may  help.  Assume  you  have  a  computer  that  always  makes  a  whirring  noise  when 
activated.  One  may  find  out  that  to  observe  the  whining  noise  a  computer  must  be  attached  to  a 
power  source,  that  it  must  be  equipped  with  a  diskette  drive  and  that  a  diskette  must  be  loaded  into 
the  drive.  We  also  note  that  the  whining  noise  is  generated  whenever  the  computer  has  been  reset 
following  a  sufficiendy  serious  error.  These  and  other  such  statements  are  collectively  a  description 
of  the  antecedent  conditions  for  that  whirring  noise.  However,  none  of  these  individually,  or  all 
together,  account  for  the  whining  noise  in  a  satisfactory  manner  because  none  specifies  the  purpose 
of  the  process  from  which  the  noise  emanates.3  Through  an  accumulation  of  such  antecedent  condi¬ 
tions  we  may  conclude  that  the  whirring  noise  is  heard  whenever  there  appears  to  be  a  need  for  addi¬ 
tional  data  not  immediately  available  in  the  computer’s  memory.  This  will  remain  a  statement  about 
the  antecedent  conditions  because  it  still  does  not  suggest  what  is  the  function  of  the  process  mani¬ 
fested  by  the  noise.  One  may  however,  after  appropriate  research,  infer  that  the  whirring  noise  is 
generated  whenever  data  must  be  transmitted  between  the  computer's  memory  and  an  external 


1  Note  Use  distinction  between  the  function  of  the  noise  itself  of  which  there  is  nooe  and  the  function  of  the  process 
generating  the  noise  as  a  byproduct  This  is  an  apt  analogy  to  the  ERP  as  we  are  not  claiming  any  functional  significance 
to  the  potentials  we  record  on  the  scalp.  Rather,  we  assume  that  what  we  record  are  byproducts  of  the  activatioe  of 
processes  which  do  have  functional  significance. 
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storage  device.  The  noise  is  generated  by  a  process  invoked  in  the  course  of  this  data  transmission, 
namely,  when  the  mechanical  device  used  to  whirl  the  diskette  around,  or  to  move  the  reading  head 
about,  is  active.  These  are  statements  about  the  functional  significance  of  the  process  manifested  by 
the  noise.  The  process  is  very  specific.  The  activation  of  the  diskette  motors  is  performed  in  the 
service  of  data  transmission  between  the  CPU  and  the  diskette.  It  has  an  actual  effect  on  the  system 
only  if  other  component  processes  are  activated  but  its  own  functional  significance  can  be  described, 
to  an  extent,  without  reference  to  these  other  processes.  Furthermore,  it  will  turn  out  that  whereas 
we  can  bring  about  this  whining  noise  whenever  we  boot  the  computer,  there  are  many  other  occa¬ 
sions  that  have  little  functional  similarity  to  the  specifics  of  starring  the  computer’s  day  which  also 
cause  the  whining  noise  to  appear  because  all  these  diverse  processes  share  a  need  to  access  the 
diskette,  they  all  activate  the  motor  and  all  activations  generate  the  same  whining  noise. 

This  analogy  illustrates  a  distinction  that  appears  not  to  have  been  made  clearly  enough  in  our  previ¬ 
ous  writings.  Note  that  the  noise  is  a  manifestation  of  an  information  processing  operation  involving 
the  transmission  of  the  data  from  a  device  called  a  diskette  into  memory.  However,  the  generator  of 
the  noise  -  namely,  the  actions  of  the  motor  •  is  not  itself  necessary  for  the  transmission  of  the  infor¬ 
mation.  Thus,  for  example,  it  is  possible  to  simulate  a  diskette  in  memory.  No  whirring  noise  will 
be  generated  when  the  computer  exchanges  data  with  these  memory-based  simulated  diskettes.  In 
other  words,  under  a  specific  set  of  circumstances  one  can  obtain  an  external  manifestation  of  an 
information  processing  operation  because  the  side  effects  of  some  ancillary  process  are  observable. 
This  side  effect,  or  the  ancillary  process  which  it  manifests,  is  not  identical  with  the  totality  of  the 
information  processing  operation,  it  merely  makes  it  observable.  This  distinction  is  important 
because  Verieger's  interpretation  of  our  model  is  that  the  P300  is  a  manifestation  of  the  updating 
process  per  se  whereas  we  have  tried  carefully  to  assert  that  it  is  a  manifestation  of  a  process 
invoked  in  the  service  of  the  updating  process,  not  necessarily  the  updating  per  se. 

We  view  the  P300,  and  all  ERP  components,  much  the  same  way  we  view  the  whirring  noise  in  the 
foregoing  example.  We  assume  that  the  component  is  generated  as  a  byproduct  of  the  activation  of  a 
neural  process  whose  function  is  the  transformation  of  information.  A  satisfactory  explanation  of  the 
component  will,  accordingly,  describe  the  specific  processes  executed  by  the  activity  of  which  the 
component  is  a  manifestation  rather  than  by  describing  what  it  takes  to  generate  the  component,  or 
to  control  its  attributes.  Note  that  neither  of  Verieger’s  construals  of  our  view,  his  "Update''  and  his 
"Expect,  '  satisfies  this  condition.  Both  purport  to  specify  the  rather  limited  conditions  under  which  a 
P300  can  be  elicited  but  neither  specifies  its  functional  significance.  "Context  updating"  as  we  inter¬ 
pret  it  attempts  to  identify  a  specific  type  of  cognitive  process  which  underlies  the  P300. 

2J.  The  Concept  of  'Psychological  Processes?' 

2-3.1.  Introductory  remarks. 

Verleger  begins  his  target  article  by  promising  to  "clarify  what  psychological  process  is  associated 
with  the  P300  component."  His  clarification  consists  of  the  suggestion  that: 

”P3s  are  evoked  by  stimuli  which  are  awaited  when  subjects  are  dealing  with  repetitive,  highly 
structured,  tasks.  Effects  on  P3  can  be  modeled  by  four  parameters:  (1)  Whether  or  not  stimuli 
match  the  expectancy  for  closing  events.  (2)  The  number  of  open  epochs.  (3)  The  degree  of 
capacity  invested  in  the  epoch  (positive  correlation)  (4)  The  degree  of  processing  required  by 
the  stimulus,  (negative  correlation)  (p.  2)." 

This  hardly  seems  to  be  the  description  of  a  psychological  process  as  one  ordinarily  understands  the 
term.  Verleger  is  not  alone  in  using  "psychological  process"  so  loosely.  There  have  been  many 
claims  in  the  literature  associating  P300,  and  other  ERP  components,  with  various  psychological 
concepts.  The  efforts  range  from  identifying  a  description  of  antecedent  conditions  with  psychologi¬ 
cal  concepts  to  rather  grandiose  attempts  to  associate  P300  with  such  global  concepts  as  "attention" 
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or  "memory.''  It  is  important  to  be  precise  in  describing  such  associations,  in  particular  with  regard 
to  the  level  of  the  concepts  used. 

2J.2.  The  hierarchy  of  concepts 

There  is  a  hierarchy  of  descriptors  of  psychological  processes  and  we  claim  that  ERPs  arc  manifes¬ 
tations  of  acdvides  related  to  an  elementary  level  in  this  hierarchy.  Descriptors  of  the  highest  level 
are  terms  such  as  modvadon,  emotion,  memory  and  learning.  These  terms,  indubitably  psychologi¬ 
cal.  describe  broad  categories  of  interactions  between  the  organism  and  the  environment.  The 
instantiation  of  any  such  function  may  call  forth  a  diverse  set  of  informadon  processing  activities. 
These  functions  may  accordingly  require  the  activation  of  such  "psychological''  processes  as 
"choice,''  "recall,''  "decision,"  or  "discrimination."  Such  descriptors  refer  to  specific  information- 
processing  functions  each  of  which  can  be  marwilled  in  the  service  of  one  or  another  of  the  func¬ 
tions  described  at  the  higher  levels.  Functions  at  the  second  level  of  the  hierarchy  require,  in  cum, 
the  use  of  more  elementary  functions  such  as  "encoding"  or  "mismatch  detection."  Descriptive 
terms  at  all  levels  of  the  hierarchy  refer  to  "psychological"  processes. 

A  computer  analogy  may  clarify  the  distinction4  Assume  that  one  can  examine  all  the  programs  in  a 
computer  system  over  a  period.  It  is  clear  that  these  programs  can  be  classified  into  different 
categories  such  as:  accounting  packages,  games,  word  processors,  statistical  programs,  and  so  on. 
This  is  a  level  of  description  that  focuses  on  the  function  of  the  programs,  the  goals  of  the  system  as 
it  interacts  with  the  user  or  the  world.  At  this  level  of  description  all  word  processing  programs  can 
be  considered  equivalent  even  though  they  may  be  quite  different  in  their  details. 

Computer  programs  can  also  be  discussed  in  a  way  that  cuts  across  these  goal  oriented  descriptions. 

A  lower  level  describes  the  functions  that  must  be  executed  in  the  system  -  with  no  necessary  com¬ 
mitment  to  implementation.  As  illustrations  consider  acquire  data,  store  data,  detect  errors,  back  up 
data,  invert  matrix  or  produce  graphic  displays.  The  important  point  is  that  this  class  and  the  one  just 
above  it  are  not  congruent.  That  is.  each  of  the  higher  levels  uses  subsystems  of  the  lower  kind. 
Thus  data  may  be  acquired  to  subserve  "accounting"  or  "data  management."  The  descriptions  that 
focus  on  the  system's  goals  are  orthogonal  to  the  descriptions  that  focus  on  the  actions  the  system 
must  perform  to  accomplish  those  goals.  This  is  even  truer  for  a  still  lower  descriptive  level  refer¬ 
ring  to  operations  at  the  implementaaonal  level  (e.g.  move  index  register  I  to  the  accumulator,  or 
release  all  I/O  channels).  [See  also  Anderson:  "Methodologies  for  Studying  Human  Knowledge" 
BBS  10(3)  1987.) 

The  analogy  extends  to  descriptors  of  psychological  processes  in  that  there  are  descriptors  that  refer 
to  the  general  class  of  goals  used  by  the  system,  "attention"  "memory"  "love"  and  so  forth  are  in  this 
sense  analogous  to  "accounting  programs."  At  a  lower  level  we  have  "psychological  processes" 
such  as  "encoding"  and  at  an  even  lower  level  we  may  have  elementary  information  processing 
operations.  It  is  not  clear  to  which  category  of  processes  Verieger  refers  when  he  speaks  of  "psycho¬ 
logical  processes."  It  would  seem,  from  his  use  of  the  term  "expectancy,”  that  he  is  referring  to 
"psychological  processes"  at  the  highest  level  of  the  hierarchy.  Yet  the  data  on  the  endogenous  com¬ 
ponents  of  the  ERP  suggest  that  they  are  "related"  to  processes  that  are  describable  at  an  elementary 
level  of  the  hierarchy  just  discussed.  Thus,  we  are  not  trying  to  "correlate"  a  brain  wave  with  a  high 
level  psychological  process,  as  implied  for  example  by  Churchland  (1986),  but  rather  to  ask  whether 
the  brain  wave,  viewed  as  the  scalp  manifestation  of  information  processing,  does  indeed  behave  in 
the  way  the  byproduct  of  such  a  processor  would  behave,  given  certain  manipulations. 


*  For  those  who  are  skeptical  about  analogies  to  computers  we  note  that  we  are  using  this  comparison  merely  to  illus¬ 
trate  the  use  of  different  descriptive  levels  when  discussing  an  in/ormabon  processing  system. 
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It  may  be  argued  that  the  study  of  byproducts  is  of  no  inherent  interest  This  is  a  shortsighted  view. 
We  suggest  that  as  long  as  there  is  a  valid  relation  between  the  process  of  interest  and  its  measurable 
byproducts,  what  matters  is  the  degree  to  which  the  measurements  can  be  placed  at  the  service  of 
interesting  theoretical  questions.  Consider,  for  example,  the  eiectrorednogram  (ERG).  Although 
clearly  the  byproduct  of  an  ancillary  process,  the  ERG  is  evidently  a  manifestation  of  the  activation 
of  the  re  ana.  The  function  it  manifests  is  the  processing  of  the  light  patterns  that  fall  on  the  retina.  It 
is  thus  a  critical  component  of  the  visual  system.  One  may  End.  upon  study,  that  ERGs  art  elicited 
when  one  directs  one's  gaze  with  love  at  one  target  and  with  hate  at  another.  This  does  not  mean 
that  the  ERG  is  "psychologically  meaningless.’'  It  means  rather  that  the  psychological  process  mani¬ 
fested  by  the  ERG  is  more  elementary  than  "love"  and  "hate"  in  the  sense  that  the  function  in  which 
it  participates,  namely,  "vision,"  may  be  involved  in  both  of  these  higher  level  psychological 
processes. 

We  have  examined  in  detail  the  nature  of  the  enterprise  in  which  we  are  engaged  as  we  theorize 
about  the  P300  because  Verleger  is  attributing  to  us  an  agenda  we  eschew.  This  error  distorts  his 
reading  of  the  Literature  and  focuses  his  attention  on  trivialities.  He  seems  to  be  persuaded  that  the 
task  we  have  undertaken  is  to  identify  a  high  level  concept  such  as  ‘‘surprise"  or  expectancy"  or 
"memory"  with  the  components.  He  then  seeks  out  evidence  that  would  be  relevant  to  the  demons¬ 
trations  he  feels  we  ought  to  be  conducting.  He  then  attacks  the  validity  of  the  evidence,  thereby 
missing  the  most  important  corollary  of  the  emphasis  on  the  search  for  the  functional  significance  of 
the  component,  namely,  the  emphasis  on  the  consequences  of  the  component  rather  than  on  its 
antecedents. 

2.4.  The  search  for  consequences 

The  concern  with  the  consequences  of  ERP  components  is  a  crucial  distinction  between  our 
approach  and  Verleger’s.  The  issue  concerns  the  process  by  which  one  validates  theoretical  asser¬ 
tions  about  an  ERP  component.  It  would  appear  that  for  Verleger  the  prime  criterion  for  evaluating  a 
theory  is  the  post-hoc  plausibility  of  the  generalizations  one  makes  about  the  antecedent  conditions. 
This  is  evident  from  his  treatment  of  (his  construai  of)  the  context  updating  model  and  from  the  way 
he  supports  his  own  model.  Consider,  for  example,  his  treatment  of  the  "Update"  model,  which  he 
summarizes  as  follows: 

"Each  stimulus  has  its  representation  (’trace’)  in  working  memory,  which  decays  over  time. 
After  stimulus  evaluation,  the  trace  of  the  presented  stimulus  is  updated  to  its  optimal  upper- 
limit  value  and  it  is  this  updating  that  is  refiectcd  by  P3.  If  the  most  recent  presentation  of  the 
present  stimulus  was  not  too  long  ago,  then  the  amount  of  updating  required  is  small,  and  so  is 
P3.  But,  if  that  presentation  was  not  too  recent,  then  the  amount  of  updating,  and  likewise  the 
P3,  has  to  be  larger."  (p.  4) 

Ignore  for  the  moment  the  fact  that  we  have  never  implied  that  the  P300  is  a  manifestation  of  the 
updating  of  specific  memory  traces.  It  is  interesting  to  examine  how  Verleger  evaluates  the  evi¬ 
dence  he  sees  as  relevant  to  this  version  of  context  updating. 

Verleger  accepts  two  studies  as  "direct  tests"  of  the  hypothesis.  In  one,  Klein.  Coles  and  Donchin 
(19&4)  reported  that  subjects  with  perfect  pitch  do  not  produce  a  P300  in  an  auditory  oddball3  task. 


1  The  tens  •oddball"  applies  to  ail  tasks  which  satisfy  the  following  condi done  (a)  The  subject  is  presented  with  a 
senes  of  events,  where  ’event"  can  be  defined  in  a  complex  and  abstract  manner  as  long  as  the  subject  can  distinguish 
one  event  hum  another,  (b)  A  cUsstficaooo  rule  is  defined  which  categorizes  each  of  (ha  events  inn  one  of  two 
categories,  (c)  The  subject  is  assigned  a  task  whose  performance  depends  on  applying  the  categorization  rule  to  the 
events,  (d)  Events  in  the  two  categories  are  chosen  at  random,  that  is,  they  consanue  a  Bernoulli  sequence.  In  general,  a 
fifth  condition  has  alao  obtained,  namely,  (e)  the  probability  of  one  of  the  categories  is  substantially  \owa  than  that  of 
tbs  other  category.  It  is  this  focus  on  the  rarity  of  one  class  of  events  flat  gave  the  pmdigm  Us  label  ("oddbsll.")  And,  in 
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The  other  is  the  series  of  studies  of  the  von  Restorff  effect  published  by  Karis,  Fabiani  and  Donchin 
(1984),  and  Fabiani,  Karis  and  Donchin  (1986).  It  is  instructive  to  note  the  discrepancy  between  the 
actual  reports  of  these  studies  and  the  way  they  are  described  by  Verleger.  Of  the  Klein  et  al  study 
Verleger  says  that  the  investigators  "presented  two  go/no-go  tasks  to  control  subjects  and  to  music 
students  who  could  identify  the  absolute  pitches  of  tones  (p.  2)”  It  is  not  merely  a  quibble  to  point 
out  that  Verleger’s  implication  that  the  control  subjects  were  not  music  students  is  not  consistent 
with  the  report  by  Klein  et  al  that  "All  subjects  were  students  of  music  at  the  University  of  Illinois 
(p.  1306)"  Neither  is  it  a  quibble  to  note  that  the  subject’s  self  report  of  their  ability  to  make  abso¬ 
lute  judgments  of  pitch  was  not  the  sole  criterion  for  the  results  reported.  In  fact,  all  subjects  were 
tested  for  their  ability  to  name  tones.  The  amplitude  of  the  P30Q  was  negatively  correlated  (r  =»  -.63) 
with  their  actual  performance  on  the  Lockhead  and  Byrd  test  (1981)  regardless  of  their  self  percep¬ 
tion  of  their  drill  as  perceivers  of  absolute  pitch.  These  correlations  are  presented  in  Table  I  of  Klein 
et  al  (page  1306).  Of  course,  had  Verleger  not  ignored  these  crucial  details,  he  might  have  been  less 
inclined  to  propose  that  Klein  et  al’s  results  can  be  explained  by  a  difference  in  motivation  between 
control  subjects  and  AP  subjects  ("who  probably  knew  that  they  were  selected  because  of  their  spe¬ 
cial  skills,  (p.  5))"  It  should  be  noted  that  Ebner,  Schutz  and  Lucking  (1986)  report  a  successful 
replicadon  of  the  Klein  et  aL  study. 

Verleger’s  presentadon  of  Karis  et  al’s  and  Fabiani  et  al's  Endings  is  similarly  inaccurate.  The  focus 
of  the  study  by  Karis  et  al  was  the  reladon  between  the  amplitude  of  P300  and  recall  in  the  context 
of  the  Von  Restorff  paradigm.  The  interesting  aspect  of  the  paradigm  is  that  a  subject  is  not 
presented  merely  with  a  "list  of  words,"  as  Verleger  states  it,  but  rather  with  a  list  of  words  some  of 
whose  members  are  deviant  from  the  rest  of  the  list  The  recall  of  such  "isolates"  is  enhanced,  as 
reported  by  von  Restorff  (1933).  Karis  et  al’s  abstract  describes  the  actual  results  as  follows: 

"there  were  strong  reladonships  between  the  Von  Restorff  effect  over  all  recall  performance, 
mnemonic  strategies,  and  the  associadon  between  components  of  the  ERP  and  recall  perfor¬ 
mance.  The  overall  recall  performance  of  subjects  who  reported  simple  (rote)  strategies  was 
low.  but  they  showed  a  high  Von  Restorff  effect  For  these  subjects  the  amplitude  of  P300  eli¬ 
cited  by  words  during  inidal  presentadon  predicted  later  recall,  la  contrast  subjects  who 
reported  complex  mnemonic  strategies  remembered  a  high  percentage  of  words  and  did  not 
show  a  von  Restorff  effect  For  these  subjects  P300  did  not  predict  later  recall,  although  a 
’slow  wave’  component  of  the  ERP  did  (p.  177)." 

This  partem  of  results  provides  striking  support  for  the  "context  updating"  model  as  we  construe  it 
It  is  conceivable  that  others,  Verleger  included,  will  not  be  as  persuaded  as  we  are  by  these  data,  but 
U  would  seem  incumbent  on  anyone  commenting  on  the  study  to  consider  the  data  as  reported  rather 
than  a  rather  selective  and  inaccurate  simplification. 

We  will  return  to  the  specifics  of  these  studies  later  as  we  examine  the  evidence  for  the  context 
updating  modeL  However,  we  note  here  that  in  his  analysis  of  what  he  calls  "direct  evidence"  for 
"Update,"  Verleger  is  primarily  concerned  with  the  degree  to  which  the  studies  demonstrate  that 
manipulations  of  "working  memory  traces"  affect  the  P300.  This  tendency  is  even  more  apparent  in 
his  discussion  of  the  "indirect  evidence."  Verleger  enumerates  a  series  of  topics  relating  to  changes 
in  P300  amplitudes  in  different  experimental  conditions.  Thus  he  notes,  inter  alia,  that  "Sequence 
effects  are  flexible,"  and  that  "Probability  might  affect  P3  independently  of  sequence"  or  that  "There 
is  a  target  effect."  All  these  asserdrns  are  correct  enough.  As  a  substantial  portion  of  the  data  base 
he  adduces  derives  from  work  conducted  in  our  laboratory  we  are  quite  content  to  accept  the  vali¬ 
dity  of  the  empirical  claims.  But,  unlike  Verleger,  we  find  all  these  data  consistent  with  the  context 

general  the  rarer  the  event,  the  larger  the  P300  it  elicits  (Duncan  Johnson  and  Donchin  1977).  However,  it  has  beta 
known  far  some  time  that  rarity  is  neither  necessary  nor  sufficient  to  elicit  a  P300.  (See  Fabiani.  Granon.  Karis  and  Don¬ 
chin  (in  press),  for  a  review  of  the  paradigms  toed  in  P300  experiments.) 
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updating  model  as  we  construe  it.  We  would  not  adduce  them  as  conclusive  evidence  for  the  model, 
however,  because  the  data  cited  by  Verleger  invariably  involve  statements  about  the  antecedent  con* 
ditions,  and  the  relations  of  these  to  the  theory  is  necessarily  post-hoc. 

The  structure  of  Verleger’ s  argument  appears  to  be  the  following.  A  model  of  the  information  pro¬ 
cessing  system  is  constructed  in  the  form  of  a  block  diagram  (see  Verieger’s  Fig  2,  p.  4).  The  data 
on  P300  are  than  considered  and  an  association  between  the  P300  and  some  element  in  the  (essen¬ 
tially  structural)  model  is  derived.  The  data  considered  in  forming  the  association  between  the 
model  and  tne  P300  are  assertions  about  antecedent  conditions.  In  this  phase  of  theorizing  one  seeks 
a  plausible  account  of  the  known  data  which  would  integrate  them  as  fully  as  possible  in  a  reason¬ 
able  model  of  the  human  information  processing  system.  This  is  fine,  except  that  it  is  entirely  circu¬ 
lar  for  data  that  were  in  forming  the  model  to  serve  as  evidence  for  the  model.  After  all,  if  the 
model  was  created  because  it  provided  a  plausible  account  of  the  antecedent  conditions  then  any 
assessment  of  the  model  against  these  data  tests  nothing  more  than  the  theorist’s  flair  for  plausible 
post  hoc  explanations. 

Theory  in  psychophysiology,  to  the  extent  that  it  ventures  ideas  about  the  functional  significance  of 
a  component,  must  seek  its  validation  in  its  predictive  power  rather  than  in  its  post-hoc  plausibility. 
This  predictive  power  can  be  tested  because  assertions  about  the  functional  significance  of  com¬ 
ponents  ought  to  generate  testable  predictions  about  the  consequences  of  the  components.  Posner 
appears  to  have  been  the  first  to  note  the  need  to  focus  on  "consequences*  in  psychophysiological 
theory.  In  the  proceedings  of  a  conference  held  in  1973,  he  is  quoted  as  remarking  (in  speaking  of 
the  P300)  that  "We  should  be  able  to  describe  the  consequences  of  its  use*  (McCollum  and  Knott 
1976,  p.  224).  The  concept  of  the  ’consequences'  of  the  use  of  an  ERP  component  derives  from 
viewing  the  component  as  a  manifestation  of  an  elementary  processing  function.  As  Done  tun  et  al 
(1986b)  put  ic 

"We  assume  chat  the  consistency  displayed  by  an  ERP  component  in  its  relation  to  experimen¬ 
tal  manipulations  is  due  to  the  fact  that  the  activity  it  manifests  is  generated  by  the  invocation 
of  a  distinct  component  of  the  information-processing  system.  A  description  of  the  component 
-  if  it  is  to  bear  meaning  for  the  cognitive  scientist  -  must  identify  the  transformations  that  the 
algorithmic  component,  or  the  "subroutine"  performs,  (p.  247)” 

The  metaphor  of  the  "subroutine"  is  important  here.  Subroutines  (these  days  the  metaphor  of  choice 
aught  have  been  the  "procedure"  or  "production  ’)  are  information  processing  entities  that  can  per¬ 
form  a  function  in  the  service  of  a  diverse  array  of  higher  level  programs.  The  programs  calling  the 
subroutine  may  bear  no  resemblance  to  each  other  except  that  at  one  time  or  another  they  require  the 
service  performed  by  the  subroutine.  The  interpretation  of  P300  as  a  "general  purpose  processor"  by 
Donchin,  Kubovy.  Kutas,  Johnson,  and  Heming  (1973)  was  an  attempt  to  cope  with  the  bewildering 
diversity  of  circumstances  in  which  the  P300  can  be  elicited.6  By  1973  a  partial  listing  of  the  cir¬ 
cumstances  under  which  P300  can  be  elicited  would  have  included  stimuli  that  resolve  uncertainty 
(Sutton  et  al  1963);  stimuli  that  are  task  relevant  (Donchin  and  Cohen  1967);  subsets  of  stimuli  used 
in  a  signal  detection  paradigm  (Hill yard.  Squires,  Bauer  and  Lindsey  1971),  or  any  first  stimulus  in  a 


4  Donchin  et  at  (1973)  used  the  following  weeds  *  In  view  of  the  diversity  of  the  P300  function  that  om  data  reveal, 
it  t— *"«  that  a  reasonable  approximation  is  to  inume  that  P300  reflects  the  activity  of  a  general-purpose  processor 
which  is  invoked  on  demand  by  a  boat  of  data  processing  requirements,  (p.  322).*  This  fora  elation  appears  »  havs 
confused  a  number  of  authors  who  interpreted  us  to  be  saying  that  P300  is  a  ganeraiiaed  processor  with  no  specific  func¬ 
tion.  The  iruau.  however,  was  quire  the  opposite.  Tbs  phnue  *  general-purpose,"  for  which  we  toed  a  Floating  Point  Pro¬ 
cesses’  as  an  example,  was  seant  to  imply  a  device  with  quite  a  specific  function  that  could,  however,  subserve  a  larger 
variety  of  higher  level  fuacaoos.  As  noted  above,  the  retina  is  a  device  that  is  specialized  far  the  acquisition  sad  initial 
processing  of  in/onnaboo  caned  by  light.  Yet  it  is  a  quite  general  device,  in  the  sense  that  it  can  be  used  to  look  at. 
in^ir  for  or  look  after  whatever  it  is  that  the  perm's  ask  requires  at  tbs  tune. 


-  10- 


scnes,  even  if  it  carries  no  significance  (Ritter,  Vaughan,  and  Costa  1968). 

Much  of  the  theoretical  effort  at  the  dme  was  directed  at  indentifying  an  overarching  concept  that 
would  subsume  this  broad  range  of  conditions.  The  task  has  proven  unwieldy  and  virtually  every 
attempt  to  solve  the  problem  has  left  pan  of  the  observadons  unexplained.  It  was  in  this  context  that 
Donchin  et  al  (1973)  suggested  that  rather  than  calling  for  an  explanatory  concept  at  the  highest 
level  of  the  hierarchy  of  concepts  it  may  be  useful  to  assume  that  the  P300  is  a  manifestation  of  an 
endry  whose  place  in  the  hierarchy  is  much  lower,  this  would  be  consistent  with  its  apparent  mul¬ 
tipurpose  nature. 

Information  processors  are  defined  by  the  transformations  they  perform  on  information.  They  have 
inputs  and  outputs  and  their  functional  description  must  specify  the  transformation  that  has  been 
applied  to  the  input  to  generate  the  output.  This  is  the  description  we  seek  of  the  processes  underly¬ 
ing  ERP  components.  It  is  critical  to  understand  that  the  ERP  itself  is  not  the  output  of  that  process. 
Like  the  whining  noise  in  our  previous  example,  the  ERP  is  a  byproduct  of  the  activity  of  the  pro¬ 
cessor  which  we  can  use  as  an  index.  The  implications  of  the  view  that  the  ERP  is  not  the  principal 
output  of  the  processor  underlying  it  were  stated  by  Donchin  et  ai  (1986)  as  follows: 

'If  the  ERP  is  not  the  critical  output  of  the  subroutine,  then  it  is  clear  that  a  description  of  the 
relationship  between  the  variables  that  control  attributes  of  the  subroutine  can  not  in  itself  con¬ 
stitute  an  adequate  description  of  the  subroutine.  The  transformations  we  seek  to  describe  are 
between  the  proper  inputs  and  the  proper  outputs  of  the  subroutine.  Therefore  the  study  of  the 
antecedent  conditions  can  only  provide  clues  from  which  one  must  derive  hypotheses  regard¬ 
ing  the  functional  significance  of  the  routine.  These  hypotheses  must  be  stared  in  terms  of  the 
effect  the  activation  of  the  subroutine  has  on  the  recipients  of  its  output.  Clearly,  to  be  of  value 
the  output  of  a  processor  must  act  as  input  to  some  other  sage  of  the  system,  (p.  247)* 

The  emphasis  on  the  study  of  the  consequences  of  the  component  is  a  direct  corollary  of  this 
approach.  If  die  ERP  manifests  an  information  processor  and  indexes  its  utilization,  and  if  we  have  a 
notion  of  what  the  transformations  are  which  the  processor  performs,  then  we  ought  to  be  able  to 
predict  what  would  happen  after  the  activation  of  the  processor  as  a  function  of  its  degree  of  activa¬ 
tion.  The  clearer  our  hypothesis  about  the  component's  function  the  clearer  the  predictions  and  the 
sharper  the  rest  of  the  model. 

3.  The  Antecedent  conditions  for  the  P300 

3.L  Interim  summary 

To  summarize  the  ideas  presented  thus  far  we  note  than 

(1)  We  assume  that  the  P300  is  a  manifestation  on  the  scalp  of  brain  activity  that  performs  a 
specific  information  processing  function.  The  activity  may  occur  in  a  number  of  different  struc¬ 
tures  concurrently,  in  a  complex  pattern. 

(2)  We  assume  that  the  brain  activity  in  question  implements  some  transformation  on  information 
that  can  be  described  in  terms  of  a  model  of  human  information  processing  (a  ’cognitive* 
model).  The  P30Q  may  well  be  the  byproduct  of  a  process  ancillary  to  the  underlying  function. 
Thus,  there  may  be  two  distinct  explanatory  burdens,  only  one  concerned  with  the  principal 
function  manifested  by  the  processes  of  which  the  P300  is  a  component. 

(3)  Theorizing  about  an  ERP  component  consists  of  specifying  candidate  functional  roles 
(specified  as  transformations  of  information).  These  theories  must  be  able  to  predict  the 
’consequences"  of  an  ERP  component.  The  theories  are  tested  by  the  degree  to  which  these 
predictions  are  confirmed. 

(4)  There  is,  of  course,  the  need  to  ensure  that  measures  of  the  ERP  components  are  reliable  and 
valid.  This  is  a  difficult  but  not  an  impossible  task  and  in  this  precommentary  we  will  assume 
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Lhai  the  methodological  problems  are  surmountable.  For  detailed  discussions  of  the  methodol¬ 
ogy  see  Fabiani  et  al  (in  press).  Coles,  Gractoo,  Kramer  and  Miller  (1986),  Glaser  and  Ruchldn 
(1976). 

(5)  Hypotheses  about  the  functional  role  of  the  activity  manifested  by  a  component  must  be 
derived  from  studies  of  its  antecedent  conditions.  We  will  therefore  begin  our  presentation  of 
the  context  updanng  model  with  a  review  of  what  is  lenown  about  the  antecedent  conditions  of 
the  P300. 

32.  Event  Probability  and  the  P300 

3-2.1-  On  the  difference  between  observation  and  interpretation 

Two  contradictory  assertions  can  be  made.  Fust,  we  note  that  it  is  neither  necessary  nor  sufficient 
for  an  event  to  be  the  rare  member  of  a  cask-relevant  Bernoulli  sequence  for  it  to  elicit  a  P300.  We 
immediately  note  that  this  caveat  notwithstanding,  the  evidence  is  overwhelming  that  in  the  main 
the  rarer  the  event  the  more  likely  it  is  to  elicit  a  P300.  Furthermore,  the  amplitude  of  the  P300  eli¬ 
cited  by  such  rare  events  will  be  inversely  related  to  the  probability  of  the  event.  Both  statements  are 
supported  by  a  considerable  body  of  data  and  both  seem  to  be  accepted  by  Verieger.  A  number  of 
authors  have  thoroughly  documented  the  reladonship  (for  a  recent  review  see  Johnson,  in  press).  It 
would  be  useful,  however,  to  comment  on  the  way  these  data  have  accumulated  with  particular 
artendon  to  those  aspects  which  relate  to  Verleger’s  discussion  of  the  role  of  "expectancy." 

One  of  the  difficulties  in  assessing  the  effect  of  event  probability  on  the  P300  is  that  investigators 
have  not  always  distinguished  between  statements  about  the  manipulations  which  are  directly  under 
the  investigators’  control,  and  statements  that  interpret  the  manipulations  in  terms  of  psychological 
constructs.  The  story  is  made  even  more  complicated  because,  as  Sutton  (1969)  has  noted,  it  is  how 
subjects  exercise  their  options  that  determines  how  fully  the  experimenter  controls  the  independent 
variables.  The  problem  appeared  with  the  very  first  paper  in  which  the  P300  was  reported  by  Sutton 
et  al  (1965).  The  results  of  that  classic  experiment  are  summarized  by  the  assertion  that  P300  is  eli¬ 
cited  by  stimuli  which  "resolve  che  subject’s  uncertainty."  However,  it  is  only  inferred  that  the 
subject’s  "uncertainty '  was  manipulated.  What  was  directly  under  Sutton’s  control  was  the  extent  to 
which  subjects  were  informed  in  advance  which  stimulus  would  be  presented.  Subjects  were  also 
instructed  to  predict  the  next  stimulus.  It  is  entirely  accurate,  therefore,  to  report  that  "when  sub¬ 
jects  are  not  informed  in  advance  which  of  the  two  possible  stimuli  will  occur,  and  they  have 
predicted  one  or  the  other,  the  ultimate  appearance  of  the  event  will  elicit  a  P300." 

It  is  plausible,  but  not  self  evident,  that  subjects  were  uncertain  as  to  which  stimulus  would  occur 
and  hence  that  the  occurrence  of  the  stimulus  did  resolve  the  uncertainty.  However,  a  number  of 
assumpnons  that  go  beyond  a  report  of  the  investigator’s  actions  are  made  when  the  concept  of 
"uncertainty"  is  invoked.  It  is  assumed,  for  example,  that  the  subjects  do  indeed  monitor  the  series, 
maintain  a  memory  of  their  predictions,  and  attend  to  the  stimuli  so  that  their  uncertainty  can  be 
resolved.  Whether  or  not  such  assumptions  are  valid  depends  on  the  effectiveness  with  which  the 
investigator  has  controlled  the  experimental  environment.  Furthermore,  it  is  incumbent  on  the  inves¬ 
tigator  to  provide  evidence  that  such  control  has  been  exercised.  (We  do  not  question  Sutton  et  al's 
interpretation  as  they  did  exercise  the  necessary  controls.) 

To  the  extent  one  reviews  the  relationship  between  P300  amplitude  and  specific  experimental  mani¬ 
pulations,  the  pattern  of  results  is  consistent  and  replicable  across  many  laboratories.  On  the  other 
hand,  when  results  are  recast  in  terms  of  explanatory  psychological  constructs  such  as  "uncertainty," 
"surprise"  or  "expectancy,"  controversy  arises,  and  it  does  so  largely  because  the  constructs  used  are 
not  consensually  defined.  Verieger,  for  sample,  while  accepting  the  validity  of  most  relevant  data, 
proceeds  to  argue  that  it  is  not  necessary  that  stimuli  be  "unexpected”  for  them  to  elicit  a  P300.  We 
will  examine  the  specifics  of  this  argument.  But,  it  would  be  useful  to  begin  with  an  examination  of 
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how  the  term  "expectancy "  was  used  by  the  Donchin  and  his  colleagues. 

3.2.2.  Prior  Probability  and  the  P300 

In  a  wide  array  of  circumstances  (though  by  no  means  in  all),  manipulating  the  probability  of  an 
attended  event  (a  "task  relevant"  event)  will  affect  the  amplitude  of  the  P300  elicited  by  the  event. 
True,  in  their  original  experiments  Sutton's  group  made  the  two  events  in  the  series  equiprobable 
and  varied  the  information  the  subject  had  about  the  next  event.  However,  Sutton  used  the  phrase 
"resolution  of  uncertainty"  in  the  formal  information-theoretic  sense.  There,  resolution  of  uncer¬ 
tainty  is  synonymous  with  "delivery  of  information,"  and  Information  is  considered  a  function  of  the 
probabilities  of  the  different  "messages."  Thus,  it  was  natural  to  examine  the  effect  that  variation  of 
the  relative  probabilities  of  the  events  would  have  on  P300  amplitude.  In  subsequent  work  (e.g  Tuet- 
ing,  Sutton  and  Zubin  1970;  Friedman.  Hakerem,  Sutton  and  Fleiss  1973),  the  probability  with 
which  events  occurred  was  varied  and  the  amplitude  of  the  P300  was  again  shown  to  be  inversely 
related  to  the  probability  of  the  eliciting  event.  A  parametric  study  of  the  odd-ball  paradigm  by 
Duncan- Johnson  and  Donchin  (1977),  using  a  larger  number  of  probabilities,  demonstrated  a  mono¬ 
tonic.  inverse,  relationship  between  P300  amplitude  and  stimulus  probability.  (See  Donchin  1979, 
and  Sutton  1979,  for  reports  of  the  status  of  the  issue  as  it  stood  in  1977;  Campbell,  Couxchesne, 
Picton  and  Squires.  1979,  provide  a  review  from  a  somewhat  different  vantage  point  which  arrives 
at  the  same  conclusions.) 

3JJ.  The  role  of  subjective  probability 

It  is  important  to  note,  however,  that  in  many  of  the  studies  considered  relevant  to  the  "probability" 
or  "expectancy"  issue  what  is  being  controlled  is  not  the  prior  probability,  defined  by  the  sequence 
generating  rules  used  by  the  experimenter,  but  rather  the  degree  to  which  the  event  is  likely  from  the 
subject’s  point  of  view.  Horst,  Johnson  and  Donchin  (1980),  for  example,  recorded  the  P300  while 
subjects  were  engaged  in  a  "paired  associates"  learning  task.  In  each  series  of  trials  the  subject  was 
expected  to  learn  which  nonsense  syllable  (traditionally  labeled  a  CVC  for  Consonant-Vowel-  Con¬ 
sonant)  must  be  produced  in  response  to  a  CVC  presented  by  the  computer.  On  each  trial  the  sub¬ 
ject  responded  with  a  CVC  and  the  computer  then  indicated  which  CVC  would  have  been  the 
correct  response  on  that  trial.  These  feedback  presentations  are  referred  to  as  the  "response"  CVCs 
by  Horst  et  al.  The  P300  elicited  by  these  CVCs  was  their  object  of  study. 

As  the  pairings  of  the  CVCs  was  entirely  arbitrary,  the  subject  had  to  learn  the  correct  response  by 
observing  the  response  CVCs.  Initially,  the  subject's  own  responses  were  almost  entirely  erroneous, 
but  as  the  trials  progressed,  subjects  did  learn  the  pairings,  and  their  responses  became  almost 
entirely  accurate.  Thus,  during  the  early  phases  of  learning  the  response  CVC  almost  always  indi¬ 
cated  that  the  subject  had  erred,  whereas  in  the  latter  sages  the  response  almost  always  indicated 
that  the  subject  was  right.  Note  that  no  particular  CVC  was  presented  frequently,  yet  the  absolute 
probability  of  a  given  CVC  did  not  predict  its  amplitude.  One  might  be  tempted  to  predict  that  the 
P300  would  vary  with  the  degree  to  which  the  CVC  indicated  whether  or  not  the  subject  had  erred, 
but  this  was  not  the  case.  It  turns  out  that  the  amplitude  of  the  P300  was  determined  by  the  interac¬ 
tion  between  the  subject’s  confidence  in  the  correctness  of  the  response  and  the  actual  outcome  of 
the  trial. 

Horst  et  al  required  the  subjects  to  report  their  confidence  before  each  response  CVC  appeared.  The 
confidence  ratings  were  analyzed  in  great  detail  and  were  normalized  so  that  each  subject’s  use  of 
the  confidence  scale  was  taken  into  account  in  the  analysis.  The  results  were  summarized  as  follows: 

"Our  dau  indicate  that  the  amplitude  of  the  P300  elicited  by  the  outcome  and  the  subject's 
expectancy  concerning  this  outcome.  Neither  confidence  by  itself  nor  whether  the  “response" 
CVC  confirmed  or  discontinued  the  subject’s  three-letter  response  accounted  for  the  variance 
in  P300.  Rather,  P300  amplitude  depended  on  the  degree  to  which  the  outcome  of  each  trial 
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was  unexpected.  The  lower  the  subjective  probability  assigned  to  the  outcome,  the  larger  was 
the  elicited  P300.  These  data  thus  strengthen  the  claim  that  P300  amplitude  is  dependent  on  the 
subjective  probability  associated  with  the  ERP -eliciting  event,  (p.  483)" 

Note  that  the  term  "unexpected"  is  defined  operationally  in  terms  of  the  discrepancy  between  the 
suojects'  reports  of  their  confidence  and  the  actual  outcome  of  the  trial.  The  observed  relationship 
between  P300  amplitude  and  the  interactive  relationship  between  outcome  and  confidence  is  an 
empirical  finding  whose  validity  depends  on  how  competently  the  experiment  was  conducted.  Once 
we  speak  of  '’subjective  probability,''  which  in  the  papers  published  by  Donchin  and  his  colleagues 
between  1970  and  1980  was  taken  to  be  synonymous  with  ’’expectancy,"  we  enter  the  realm  of 
interpretation.  The  term  "expectancy"  was  defined  by  Squires,  K.C.,  Wickens,  C.  D.,  Squires,  NJC 
and  Donchin.  E.  (1976)  as  referring  to  a  hypothetical  construct  that  accounted  for  their  observation 
that  even  though  the  probability  of  events  predicted  the  amplitude  of  P300  when  measured  from  the 
ERP  averaged  over  trials,  there  was  substantial  variability  in  the  amplitude  across  trials.  Much  of 
this  variance  (78%)  could  be  accounted  for  by  assuming  that  P3 00  is  an  inverse  function  of  "expec¬ 
tancy"  and  that  this  expectancy  varies  according  to  a  three  component  model  which  combines  an 
exponentially  decaying  memory  with  the  prior  probability  of  events  and  with  the  effect  of  alternat¬ 
ing  sequences.  A  critical  implication  of  this  model  has  been  that  it  is  the  "subjective"  rather  than  the 
prior  probability  that  determines  the  amplitude  of  P300.  It  is  to  test  this  hypothesis  that  Horst  et  al 
measured  the  interaction  between  subjects’  confidence  and  actual  events.  They  present  their 
interpretation  of  the  results  as  follows: 

"Our  notion  of  subjective  probability  implies  that  subjects  apply  their  knowledge  about  a  given 
situation  to  form  differential  expectancies  (subjective  probabilities)  for  the  various  events  that 
might  occur.  These  expectancies,  being  derived  from  external  information  that  is  filtered  by 
subjects’  perceptual  biases,  stored  in  a  fallible  memory,  and  tainted  by  the  individual's  predi¬ 
lections,  are  "subjective"  in  that  they  need  not  accurately  reflect  the  objective  probabilities  with 
which  events  occur.. .Information  processing  triggered  by  the  occurrence  of  an  event  is  affected 
by  the  expectancy  associated  with  that  event.  An  aspect  of  the  processing  invoked  by  unex¬ 
pected  events  is  reflected  in  P300  amplitude  p.  484)." 

We  dwell  on  the  Horst  et  al  study  because  it  receives  detailed  attention  from  Verieger,  who  begins 
by  stating  that: 

"Another  line  of  evidence  frequently  quoted  in  support  of  the  claim  that  P3  reflects  surprise  is 
that  disconnrming  feedback  evoked  larger  P3$  than  confirming  feedback  (Squires  et  al  1973: 
Horn  et  al  1980;  De  Swan.  Kok  and  Das  Smaal,  1981).  p.  12” 

This  is  a  very  partial  and  therefore  quite  misleading  summary  of  all  three  studies.  In  the  case  of 
Horst  et  al  (1980),  it  is  patently  incorrect,  because  these  investigators  clearly  denied  an  exclusive 
reladoaship  between  the  status  of  the  stimulus  as  a  confirming  or  discontinuing  event.  Rather,  they 
attempted  to  measure  directly  the  subject's  expectations  for  confirmations  and  disconfirmadons  and 
reported  that  both  confirming  and  discontinuing  feedback  will  at  times  elicit  a  large  P300  and  at 
other  rim**  a  small  one  depending  on  the  subject's  level  of  confidence.  Horst  et  al’s  analysis  is  quite 
detailed.  It  may  not  be  entirely  persuasive  and  it  is  always  possible  that  there  are  flaws  in  it  and  in 
the  assumptions  that  underlie  it.  We  would  welcome  a  valid  critique  of  these  analyses,  but  that  is  not 
what  Verleger’s  erroneous  description  seems  to  represent 

In  a  subsequent  reference  to  Horst  et  al  (1980),  Verieger  uses  a  strategy  be  employs  elsewhere  in  his 
target  article,  ^  ******  0f  dealing  with  the  reported  data,  he  surmises  what  subjects  may  have  ima¬ 
gined  while  participating  in  the  experiment  His  language  is  instructive,  as  it  illustrates  the  level  of 
his  critical  analysis  of  empirical  data: 

"In  the  Horst  et  al  study,  four  of  the  six  subjects  were  experienced,  and  all  subjects  were 
instructed  *We  want  to  correlate  your  confidence  rating  with  your  brain  waves.’  In  conclusion. 
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subjects  could  easily  have  imagined  or  even  known  exactly  which  outcomes  were  the  object  of 
the  study,  so  that  they  expected  those  outcomes  with  greater  interest,  p.  13" 

This  interpretation  is  clearly  at  variance  with  the  data.  It  is  entirely  unclear  how  the  subjects’  imagi- 
nation  could  have  controlled  the  outcomes  with  such  precision  when  the  subjects  could  not  possibly 
know  until  they  learned  the  correct  pairings  which  outcomes  to  expect  Perhaps  Verleger  is  implying 
here  that  the  subjects  were  more  interested  in  outcomes  that  violated  their  predictions,  but  this  is 
unlikely  because  the  outcome  per  se  did  not  determine  the  P300  amplitude.  Rather,  it  was  the 
interaction  between  outcome  and  confidence  that  was  the  controlling  variable.  Furthermore,  Ver¬ 
leger  fails  to  deal  seriously  with  the  very  detailed  trial  to  trial  analysis  conducted  by  Horn  et  aL  He 
likewise  ignores  the  complex  pattern  of  results  reported  by  Squires  et  al  (1973),  focusing  instead  on 
the  fact  that  the  experimenters  served  as  subjects.  He  also  fails  to  take  into  account  the  very  many 
replications  of  the  signal  detection  work  which  followed  that  seminal  experiment  and  which  repeat¬ 
edly  found  the  pattern  of  results  (see,  for  example,  Campbell  et  al  1979*,  Hillyard  et  al  1971). 

3.2.4.  The  role  of  Expectancy 

It  would  appear  then  that  the  effect  of  "expectancy1'  on  the  amplitude  of  the  P300  cannot  be  denied 
if  "expectancy"  operationally  defines  describes  the  conditions  of  stimulation  and  their  interactions 
with  the  instructions  to  the  subject  This  does  not  mean  that  the  relationship  will  hold  for  every  con- 
strual  of  the  tenn  “expectancy"  as  a  psychological  construct.  Such  consnuals  must  be  made  in  the 
constraints  of  a  psychological  theory  and  their  validity  has  to  be  tested  by  their  predictive  power 
rather  than  by  their  apparent  post-hoc  plausibility  or  implausibility. 

It  is  also  important  to  emphasize  that  at  no  time  did  we  assert  that  the  subjective  probability  associ¬ 
ated  with  events  is  the  sole  determinant  of  P300  amplitude.  Indeed,  it  became  evident  quite  early 
(Donchin  and  Cohen  1967)  that  whereas  to  a  first  approximation  P300  amplitude  was  strongly 
affected  by  the  rarity  of  the  eliciting  event,  it  was  not  possible  to  account  for  all  its  variance  merely 
by  specifying  the  prior  probability  of  the  eliciting  stimuli  The  need  to  go  beyond  "objective”  or 
prior  probabilities  became  evident  with  the  discovery  of  the  effects  of  the  preceding  stimulus 
sequence  on  the  P300,  first  reported  by  Squires  et  al  (1976). 

As  Verleger  pays  much  attention  to  these  data,  it  is  worth  reminding  the  reader  what  Squires  et  al 
reported:  The  amplitude  of  P300.  when  examined  in  a  Bernoulli  series  governed  by  prior  probability 
P,  varies  as  a  function  of  the  sequence  of  stimuli  presented  on  the  preceding  trials.  When  plotted  as 
a  "tree."  these  sequential  effects  mimicked  the  sequential  effects  often  observed  in  reaction  times 
(Faimagne  1965;  Audley  1973).  Squires  et  al  suggested  that  the  variable  which  controls  the  ampli¬ 
tude  of  P300  is  the  subjective  probability,  rather  than  the  prior  probability  of  events.  They  proposed 
that  this  subjective  probability,  or  "expectancy,"  varies  as  a  function  of  a  decaying  memory  for  each 
stimulus  in  the  series,  the  global  probability  P  and  an  "alternation"  factor  that  was  required  to 
account  for  unique  responses  to  stimuli  that  immediately  followed  an  alternating  pattern.  The  model 
has  proved  quite  powerful  and,  as  Verleger  notes,  the  results  have  been  widely  replicated. 

Squires.  Petuchowsky,  S.,  Wickens,  C  D.  and  Donchin,  E.  (1977),  demonstrated  that  the  sequential 
effects  appear  in  both  visual  and  auditory  oddball  series  and  that  the  model  presented  by  Squires  et 
al  (1976)  accounts  for  the  results  in  both  modalities  provided  one  takes  into  account  their  differen¬ 
tial  decay  rates.  Duncan- Johnson  and  Donchin  (1977)  reported  that  sequence  effects  are  additive 
with  the  effects  of  prior  probability.  Johnson  and  Donchin  (1980)  demonstrated  that  when  the  sub¬ 
ject  counts  one  of  three  equally  probable  stimuli  the  two  uncounted  stimuli  enter  the  sequence  with 
the  probability  of  their  category  (Le.  P( uncounted}  ■  .66).  We  could  multiply  such  details  from  our 
Laboratory  and  those  of  others.  The  story  that  emerges  is  rather  complex,  but  the  complexity  in  no 
way  justifies  Verleger’s  assertion  that  "Limits  were  reported  on  the  validity  of  these  findings  (p^  4)" 
Perhaps  Verleger  is  not  quite  precise  in  his  use  of  the  word  "validity."  If  he  means  "generality,"  we 
are  surely  in  agreement.  Gearly,  the  sequential  effects  will  be  observed  only  if  the  circumstances 


are  appropriate  and  it  is  very  important  to  specify  these  limits.  However,  the  existence  of  these  Lim¬ 
its  does  not  reflect  on  the  validity  of  the  findings. 

To  consider  one  example  of  such  limits:  Verieger  cites  the  finding  by  Ouncan-Johnson  and  Donchin 
(1982)  that  "sequence  effects  were  eliminated  when  a  warning  signal  was  interspersed  that  allowed 
for  predicting  the  next  stimulus  above  chance  level  p.  6)."  This  is  not  quite  an  accurate  report  of 
Duncan-Johnson  and  Donchin 's  results.  The  study  presented  subjects  with  an  oddball  task  in  which 
each  of  the  stimuli  in  the  series  was  preceded  by  a  warning  stimulus  in  all  experimental  conditions. 
However,  in  some  series  the  warning  stimulus  provided  no  information  about  the  probability  with 
which  the  imperative  stimulus  on  that  trial  would  occur.  These  series  are  contrasted  with  series  in 
which  each  warning  stimulus  provided  information  about  the  probability  distribution  of  the  impera¬ 
tive  stimuli.  In  other  words,  in  some  senes  the  sequence  of  stimuli  was  the  subjects'  sole  source  of 
information  about  the  probability  distnbubons;  in  others  the  sequence  carried  no  such  information 
because  the  distribudon  was  determined  afresh  by  each  warning  stimulus  on  each  trial.  Note  that  the 
structure  of  the  sequence  and  the  pnor  probabilities  (over  the  entire  series)  were  the  same  in  both 
series. 

The  experiment  in  question  was  designed  to  test  the  hypothesis  that  the  sequential  effects  are 
observed  because  the  subjects  are  extracting  information  about  the  probability  of  events  in  the 
environment  from  the  preceding  sequence.  The  data  demonstrated  that  stimuli  do  indeed  affect  the 
P300  elicited  by  a  succeeding  event  if,  and  only  if,  the  subject  has  no  other  source  of  information 
about  the  probabilities.  This  appeared  to  support  rather  nicely  the  model  presented  by  Squires  et  al 
( 1976),  and  enhanced,  rather  than  cast  doubt  on,  the  validity  of  the  basic  observations  on  the  sequen¬ 
tial  effects.  Furthermore,  the  results  are  quite  consistent  with  the  suggestion  that  the  P300  is  a  man¬ 
ifestation  of  a  process  involved  in  the  maintenance  of  a  model  of  the  environment.  The  "flexibility" 
of  the  effect  can  be  construed  as  damaging  to  the  context  updating  model  only  in  the  framework  of 
the  model  which  Verieger  labels  "Update."  If  P3QQ  requires  (as  Verieger  implies  we  suggested)  the 
updating  of  specific  traces  in  a  buffer  •  an  updating  with  no  relationship  to  a  larger  purpose  •  then 
indeed  the  relationship  should  show  no  flexibility.  But,  as  will  be  seen  below,  and  as  should  be  evi¬ 
dent  from  the  literature,  "Update"  Is  really  just  a  caricature  of  our  model. 

3  J.  Task  Relevance 

Even  though  the  effect  of  subjective  probability  discussed  is  powerful  and  has  received  a  prominent 
share  of  investigative  attention  it  was  quite  evident  from  the  earliest  studies  that  at  least  one  other 
factor  must  be  taken  into  account,  namely,  "task  relevance."  The  simplest  demonstration  of  this  fact 
is  the  observation  that  no  P300  is  elicited  when  subjects  "ignore"  the  very  same  series  of  events 
whose  rare  members  elicit  a  large  P’300  when  they  attend  to  the  events.  For  example,  in  the  case  of 
the  the  ERPs  elicited  by  the  stimuli  used  by  Duncan-Johnson  and  Donchin  (1977)  when  their  sub¬ 
jects  were  solving  a  word  puzzle,  virtually  no  P300s  are  elicited  by  the  rare  events.  Yet  these  very 
same  stimuli,  when  counted,  elicit  large  P300s.  As  Verieger  states  by  way  of  a  critique.  There  is  a 
target  effect  (p.  6)."  On  this  we  agree.  But  what  is  difficult  to  see  is  how  this  observation,  which  we 
consider  a  critical  dement  in  the  evidence  for  the  context  updating  model,  is  used  by  Verieger  in 
support  of  his  negative  view. 

The  sensitivity  of  P300  amplitude  to  the  allocation  of  the  subject’s  resources  to  the  task  in  which  the 
eliciting  stimuli  are  embedded  has  been  examined  by  Donchin  and  his  colleagues  in  an  extensive 
series  of  studies  attempting  to  develop  a  measure  of  mental  workload  based  on  the  amplitude  of 
P300  (Donchin  et  al  1986a:  Gopher  and  Donchin  1986).  The  strategy  in  most  of  these  experiments 
has  been  to  require  subjects  to  perform  the  oddball  task  concurrently  with  another  "primary"  task 
whose  demands  on  their  aoentional  resources  could  be  varied.  The  results,  most  of  which  are 
ignored  by  Verieger.  have  been  rather  consistent.  The  mote  demanding  the  primary  task,  the  lower 
die  amplitude  of  P300  elicited  by  the  concurrently  presented  oddball  stimuli  (see  also  Ro"sSct  1983). 
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Here  again,  upon  careful  examination  the  partem  of  results  proves  rather  specific.  Not  every 
increase  in  demand  by  the  primary  task  has  an  effect  on  the  P300  that  is  elicited  by  secondary  task 
stimuli.  Isreai,  Wickens.  Chesney  and  Donchin  (1980),  have  shown  that  to  be  effective  the  demands 
must  be  concentrated  in  the  perceptual  domain.  At  least,  it  was  clear  from  their  data  that  increasing 
the  motor  demands  of  the  primary  task  does  not  affect  P30G.  Kramer,  Wickens  and  Donchin  (1985) 
examined  the  nature  of  the  resources  which  must  be  in  demand  for  an  effect  to  be  observed;  they 
have  shown  that  the  interactions  can  be  controlled  by  manipulating  the  relationship  in  visual  space 
between  the  stimuli  associated  with  the  two  tasks. 

The  general  implication  of  these  data  is  that  the  processing  of  which  P300  is  a  manifestation  is  sen¬ 
sitive  to  the  use  of  a  limited  capacity  system  which  can  be  deployed  in  the  service  of  one  task  or 
another.  Strong  evidence  in  support  of  this  suggesnon  was  provided  by  Wickens,  Kramer,  Vanasse 
and  Donchin  (1983)  see  also  Sirevaag,  Kramer,  Coles  and  Donchin  1986),  who  measured  con¬ 
currently  the  P300s  associated  with  both  the  primary  and  the  secondary  task.  It  turns  out  that  as  the 
difficulty  of  the  primary  task  is  increased  the  amplitude  of  these  P300s  changes  in  a  reciprocal 
fashion.  That  is,  whereas  P300s  associated  with  the  secondary  task  decrease  in  amplitude,  the  P300s 
associated  with  the  primary  task  increase.  The  sum  of  the  two  concurrently  recorded  P300s  was  con¬ 
stant  over  the  range  of  difficuldes  used  for  the  primary  task. 

As  the  data  have  been  reviewed  in  detail  elsewhere,  and  as  they  received  but  little  of  Verleger's 
attention,  we  will  not  examine  the  effect  of  task  relevance  in  further  detail  Suffice  it  to  note  that  one 
cannot  provide  a  satisfactory  account  of  the  antecedent  condidons  for  P300  without  considering  task 
relevance.  An  important  analysis  of  the  interacdon  between  probability  and  task  relevance  has  been 
provided  by  Johnson  (in  press),  who  reviews  the  evidence  for  a  muldplicadve  relationship  between 
these  variables.  Johnson’s  review  highlights  the  fact  that  task  relevance  operates  only  to  the  extent 
that  the  subject  extracts  the  relevant  information  from  the  stimulus.  He  suggests  that  the  meaningful¬ 
ness  of  the  stimulus  -  chat  is,  its  task  relevance  defined  externally  to  the  subject  ->  can  only  operate 
to  the  extent  tnat  the  subject  extracts  the  necessary  data.  This  is  an  extension  of  Ruchltin  and 
Sutton's  (1978)  observation  that  "equivocation  "  will  reduce  the  amplitude  of  P300. 

This  well  taken  point  notes  that  it  is  "subjective''  task  relevance,  just  as  it  is  "subjective"  probability, 
that  is  the  critical  variable.  In  other  words,  the  locus  of  the  effects  we  are  studying  is  the  subject’s 
mind  (or  brain)  rather  than  the  apparatus  that  generates  the  experimental  conditions.  Probability  is 
filtered  by  the  subject's  concept  of  the  probability  of  events  and  informadon  is  effective  only  to  the 
extent  that  it  is  retrieved,  processed  and  used.  This  view  is  highly  consistent  with  our  view  of  P300 
as  the  manifestation  of  processing  which  is  very  sensitive  to  the  cognitive  operations  elicited  by 
events  when  subjects  perrrom  certain  asks.  Any  view  as  narrow  as  "Update."  or  "Expect  "  in  the 
form  enunciated  by  Verleger  is  necessarily  implausible. 

3.4.  The  Latency  of  P300 

Whereas  the  information  about  the  effects  of  probability  and  ask  relevance  that  we  discussed  above 
played  a  critical  role  in  the  development  of  our  interpretation  of  the  P300,  a  dif  erent  body  of  data, 
one  to  which  Verleger  pays  little  attention,  served  as  the  key  element  in  the  development  of  the  con¬ 
text  updating  model.  This  is  an  extensive  literature  concerned  with  the  latency  of  the  P300.  The 
interest  in  this  problem  was  triggered  by  the  obvious  dissociation  between  reaction  time  and  P300 
latency.  Here  again,  the  empirical  data  are  clear,  but  their  interpretation  has  been  controversial. 

In  1987  it  may  be  difficult  to  recall  that  the  dissociation  between  the  latency  of  the  P300  anti  reac¬ 
tion  time  was  at  one  time  considered  a  major  impediment  to  the  interpretation  of  the  component. 
Donchin  et  al  (1978),  Donchin  (1984)  and  Tueting  (1978)  review  the  issue.  There  were  two  central 
concerns.  First,  conflicting  data  have  been  presented  about  the  correlation  between  P300  latency 
and  RT.  Ritter  et  al  (1972),  showed  quite  convincingly  that  the  two  variables,  when  taken  on  a  trial 
by  trial  basis,  are  positively  correlated.  Other  investigators,  however,  failed  to  observe  the 
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correlation. 

A  second  problem  was  the  general  expectation  that  P300  would  precede  the  reaction.  This  expecta¬ 
tion  was  based  on  the  notion  that  processing  begins  with  a  stimulus  and  ends  with  a  response.  If 
P300  (so  went  the  argument)  is  a  "correlate '  of  information  processing,  it  must  take  place  before  the 
response,  or  else  one  would  have  to  say  that  it  cannot  reflea  processing,  because  all  processing  was 
presumed  to  precede  the  response. 

Both  of  these  difficulties  led  Donchin  (1979,  see  also  Tuedng  1978),  to  suggest  that  it  may  be  best 

to: 

"postulate  the  existence  of  multiple,  parallel  processors,  continually  active.  Stimuli  impinge  on 
this  stream  of  processing  and  modulate  and  activate  in  different  ways  this  complex  matrix. 
Multiple  responses  are  evoked  by  a  given  stimulus.  Some  with  immediate  consequences,  others 
will  not  be  manifested  for  some  time,  exercising  their  effea  through  changes  in  the  subjea’s 
strategies  rather  than  through  their  effects  on  the  immediate  responses.  If  P300  is  associated 
with  processes  that  are  more  related  to  strategic  rather  than  to  tactical  information  processing, 
then  the  degree  to  which  it('s  latency)  correlates  with  RT  will  depend  on  the  strategies  the  sub- 
ject  tend  to  adopt  A  dissociation  between  RT  and  P300  latency  becomes,  in  this  context  far 
more  interesting  than  associations  between  these  variables.  (Donchin  1984,  p.  114)" 

These  assertions  were  presented  concurrently  with  evidence  reported  by  Kutas,  McCarthy  and  Don¬ 
chin  (1977)  that  the  correlation  between  P30G  latency  and  RT  varies  with  the  level  of  accuracy 
demanded  of  a  subject  This  result  is  hardly  surprising.  If  P300  is  elicited  by  an  event  as  a  function 
of  the  category  to  which  that  event  is  assigned  then  P300  will  be  elicited  only  after  categorization 
and  this  latency  would  be  proportional  to  categorization  time.  The  oven  reactions  whose  time  is 
measured  may  or  may  not  be  elicited  on  the  baas  of  a  full  analysis  of  the  stimulus.  The  prevalence 
of  fastguesses  when  speed  of  reaction  is  stressed  is  well  known. 

Thus,  the  variance  in  RT  is  affected  by  many  factors.  Only  a  subset  of  these  factors  affects  the 
latency  of  P300.  A  striking  confirmation  was  provided  by  Renault  and  his  coworkers  (Ragoc  and 
Renault  1981).  who  found,  in  a  choice  RT  task,  that  when  subjects  respond  with  their  hands  crossed 
their  RTs  are  greatly  increased.  This  source  of  a  substantial  portion  of  the  variance  in  RT  had  no 
effect  whatsoever  on  the  latency  of  the  P300.  These  data  confirmed  the  proposal  made  by  Kutas  et  al 
(1977)  and  by  McCarthy  and  Donchin  (1981)  that  P300  latency  is  insensitive  to  variables  related  to 
the  execution  of  the  response. 

The  pattern  of  the  relationship  between  RT  and  P300  latency  has  been  reviewed  in  detail  elsewhere 
(see  Hill  yard  and  Kutas  1983;  Duncan*  Johnson  and  Donchin  (1977),  Donchin  et  al  (1986b);  and  for 
more  recent  work  extending  (his  approach  and  combining  it  with  studies  of  response  preparation, 
see  Coles  and  G ration  1986;  Gratton,  Coles,  Sixevaag,  Eriksen  and  Donchin,  in  press).  For  our 
present  purpose  it  is  sufficient  to  note  one  critical  implication  of  these  data,  namely,  that  the  pro¬ 
cessing  manifested  by  the  P300  is  apparently  not  necessary  for  the  execution  of  the  oven  responses 
the  subject  must  make  to  the  stimulus.  On  occasion,  especially  when  subjects  are  accurate,  the 
responses  are  delayed  with  res  pea  to  P300  and  the  correlation  between  P300  latency  and  RT 
increases  (see  Kutas  et  al  1977  as  well  as  Coles,  Gratton,  B ashore,  Eriksen  and  Donchin,  1985).  In 
other  words,  when  the  variance  in  RT  is  shared  with  the  variance  in  P300  latency,  the  correlation 
between  these  two  measures  will  be  positive  and  fairly  large.  But,  in  the  main,  a  dissociation 
between  these  two  measures  is  observed  frequently,  suggesting  that  oven  responses  can  be  emitted 
without  awaiting  the  processing  manifested  by  the  P300.  It  is  these  data  that  suggest  most  power¬ 
fully  that  the  P300  is  a  manifestation  of  processing  used  in  maintaining  our  model  of  the  environ¬ 
ment 
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4.  The  context  updating  Model 
4.1.  Summary  of  preceding  argument 

The  task  we  undertake,  as  we  outline  the  context  updating  model  has  been  well  summarized  by 
Ro'sler  (1983). 

'To  make  progress  towards  finding  an  exact  definition  of  the  functional  state  indicated  by 
P300,  one  has  to  do  something  more  than  just  manipulate  input,  context  or  output  variables. 
Therefore,  the  additional  step  is  to  abstract  from  the  directly  observed  variables  and  the  partic¬ 
ular  experimental  settings  one  common  'denominator',  an  intervening  variable  processing  con¬ 
struct,  which  can  be  taken  as  the  possible  if  one  relies  on  another  source  of  information  besides 
che  directly  observable  facts.  One  must  add  some  assumptions  about  the  information  process¬ 
ing  activities  performed  by  the  brain;  in  short,  one  must  formulate,  mote  or  less  explicitly,  a 
cognidve  theory,  (p.  IT)" 

In  the  immediately  preceding  sections  of  this  precommemary  we  outlined  the  "directly  observable 
facts.  "  Before  we  outline  the  "cognidve  theory  "  that  may  account  for  these  facts  we  briefly  summar¬ 
ize  chose  pieces  of  the  puzzle  we  consider  crucial  to  the  evoludon  of  our  model 

(1)  The  amplitude  of  P300  is  controlled,  in  a  muldplicadve  fashion,  directly  by  the  task-relevance 
of  the  eliciting  event  and  inversely  by  the  subjective  probability  of  the  event.  We  note  here  that 
even  though  Vcrieger  argues  that  "P3s  are  not  evoked  by  unexpected  stimuli,"  his  argument 
consists  of  a  conjecture  about  the  psychological  state  of  the  subject.  That  is,  Verleger  is  argu¬ 
ing  that  rather  than  being  surprised  by  the  eliciting  stimuli  the  subject  is  actually  awaiting  those 
stimuli  There  is  very  little  evidence,  however,  that  would  support  speculations  about  the  sub¬ 
jects’  state  of  mind  at  the  time  the  eliciting  stimuli  are  presented.  The  closest  any  one  has  come 
to  directly  measuring  the  subjects’  psychological  states  in  these  experiments  have  been  the 
assessment  of  their  confidence  in  their  predictions,  guesses  or  judgments  (e.g.  Hill  yard  et  al 
1971;  Horst  et  al  1980)  and,  as  we  noted  above,  these  suggest  that  surprising  events  do  elicit  a 
larger  P300.  However,  whatever  one  concludes  with  respect  to  this  issue,  there  is  hardly  room 
for  doubt  concerning  the  strictly  empirical  asserdon  that,  in  a  range  of  experimental  parame¬ 
ters.  reducing  the  probability  of  an  event  will  increase  the  amplitude  of  the  P300  elicited  by  the 
event.  This  effect  of  probability  is  tempered  by  its  multiplicative  interacuon  with  the  effect  of 
task-relevance  (Johnson,  in  press). 

(2)  Overt  responses  required  of  the  subject  on  a  given  trial  can  be  emitted  prior  to  the  appearance 
of  the  P300.  If  the  response  reflects  a  discrimination  among  stimuli  then  responses  emitted 
prior  to  the  appearance  of  the  P300  are  more  likely  to  be  inaccurate.  The  correlation  between 
the  latency  of  the  P30Q  and  the  RT,  computed  over  (rials,  is  not  necessarily  significant  Thus, 
different  factors  appear  to  control  the  variance  in  these  two  dependent  measures.  In  other 
words,  the  P300  is  relatively  decoupled  from  the  motor  output  the  subject  must  generate  in 
response  to  the  eliciting  event 

(3)  The  P300  seems  to  manifest  activity  in  a  limited  capacity  system  whose  use  in  the  service  of 
different  casks  is  under  relative  control  by  instructions.  There  is  evidence  that  the  total  ampli¬ 
tude  of  P300  in  the  presence  of  concurrent  tasks  is  constant 

(4)  The  amplitude  of  P300  seems  to  be  sensitive  to  the  intervals  between  successive  stimuli  One 
can  account  for  the  sequential  effects  in  P300  amplitude  by  assuming  that  it  is  inversely  pro¬ 
portional  to  the  decay  of  the  representation  established  by  preceding  occurrences  of  the  elicit¬ 
ing  events.  Furthermore,  the  ubiquitous  effect  of  probability  on  the  amplitude  of  the  P300  in  an 
oddball  paradigm  is  drastically  reduced,  if  not  eliminated,  when  the  interval  between  events  is 
increased  (Heffley,  Wickens  and  Donchin,  1978;  Fitzgerald  and  Picton  1981). 


4  2.  The  Model 

As  we  proceed  to  develop  a  "cognitive"  theory  to  identify  the  processing  function  manifested  by  the 
P300  taking  the  above  into  account,  we  find  the  dissociaQon  between  P300  latency  and  RT  to  be  the 
most  compelling  piece  of  the  puzzle.  It  is  a  particularly  important  datum  because  it  eliminates  from 
consideration  the  rather  large  class  of  information  processing  activities  involved  in  selecting  and 
executing  the  oven  response  to  the  eveot  which  elicits  the  P300.  The  idea  at  the  core  of  the  context 
updating  model  is  that  P300  is  invoked  in  the  service  of  "strategic''  rather  than  "tactical”  information 
processing  (Donchin  et  al  1978).  According  to  the  strategic/tactical  distinction  information  is  pro¬ 
cessed  in  parallel  interacting  streams.  The  "tactical"  stream  is  concerned  with  the  processes  that 
select  a  specific  set  of  responses  at  any  instance,  that  is.  it  controls  oven  behavior  in  an  immediate 
sense:  actions.  The  'strategic''  stream  is  involved  in  the  planning  and  control  of  behavior,  such 
things  as  the  long  tens  setting  of  priorities,  the  setting  of  biases,  the  mapping  of  probabilities  on  the 
environment  or  the  deployment  of  attention.  These  are  merely  illustrations  of  the  vast  realm  of 
metacontrol  processes  that  must  be  implemented  as  the  infrastructural  support  of  overt  behavior. 
The  specific  physical  events  occurring  on  a  trial,  be  they  stimuli  or  responses,  serve  as  data  for  this 
scream  of  processing.  We  suggest  that  the  P300  is  a  manifestadon  of  processes  in  the  strategic 
stream. 

Note  that  the  evidence  pointing  to  such  a  meta-control  role  for  the  P300  is  strong  even  though  it 
yields  no  clue  to  the  specific  function  of  the  process  of  which  the  P3Q0  is  a  byproduct.  The  key  to 
the  puzzle  contributed  by  the  latency  data  is  that  it  directs  our  attention  to  processes  which  are  not 
critical  to  the  immediately  executed  response.  Three  aspects  of  the  data  supply  this  clue.  First,  the 
latency  of  P300  often  exceeds  the  RT.  The  obvious  conclusion  is  that  the  process  it  manifests  is  not 
a  necessary  condition  for  the  emission  of  the  response.  Second,  the  coiTeladon  between  P300 
latency  and  RT  depends  on  the  level  of  accuracy  the  subject  is  striving  to  achieve.  This  suggests  that 
the  elicitadon  of  P300  depends  on  a  more  extensive  extraction  of  informadon  about  the  event  and  on 
the  processing  of  that  informadon.  Finally,  we  note  that  the  latency  of  the  P300  appears  to  be  unaf¬ 
fected  by  processes  associated  with  the  execudon  of  the  response  (McCarthy  and  Donchin  1981). 
This  again  suggests  that  the  function  manifested  by  the  P30Q  is  associated  with  the  evaluadon  of 
stimuli 

When  we  use  such  words  as  "associated"  or  "reflects"  in  referring  to  the  relationship  between  P300 
and,  say,  "stimulus  evaluation,"  we  do  not  intend  to  imply  that  the  P300  is  in  fact  a  byproduct  of 
stimulus  evaluadon.  Such  a  suggestion  would  be  inherently  invalid  because  stimulus  evaluation  is 
not  a  unitary  processing  entity  but  a  multiplicity  of  processes.  The  data  on  P300  latency  allow  a 
choice  between  two  classes  of  processing  functions  -  those  involved  in  the  decoding  and  assessment 
of  information  and  those  associated  with  the  execution  of  oven  responses.  The  P300  seems  to  be 
independent  of  the  latter  category.  It  seems  reasonable  to  assume  that  its  function  once  identified 
will  belong  to  the  former  category,  which  we  loosely  label  "stimulus  evaluation."  Additional  sup¬ 
port  for  this  view  comes  from  the  studies  of  the  behavior  of  P30G  in  dual-task  studies.  Isreal  et  al 
(1980),  as  well  as  Kramer,  Wickens  and  Donchin  (1983),  have  reported  that  to  be  effective  in  reduc¬ 
ing  the  amplitude  of  the  P300  elicited  by  secondary  task  stimuli  the  load  of  the  primary  task  must  be 
increased  in  the  perceptual  rather  than  the  motor  domain.  Thus,  the  general  class  of  processes  of 
which  the  P300  is  a  manifestation  probably  involves  a  perceptual  evaluation.  The  specific  selection 
in  this  category,  however,  must  be  made  on  the  basis  of  additional  data.  Once  we  accept  P300  to  be 
associated  with  "strategic,"  future  oriented,  processing  we  must  seek  a  more  precise  definition  of  the 
specific  process. 

It  is  at  this  point  that  the  fact  that  the  P300  is  often  associated  with  events  whose  probability  is  low 
begins  to  affect  our  thinking.  What  function  could  one  reasonably  ascribe  to  a  processor  activated 
by  rare  events,  one  whose  amplitude  is  directly  proportional  to  rarity?  Two  assumptions  enter  our 
interpretation  of  these  observations.  Both  are  tenuous  and  an*,  justified  largely  by  their  plausibility 
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and  their  heuristic  value.  The  first  assumption  is  the  weakest  link  in  our  entire  structure.  It  is  that  the 
amplitude  of  the  P300  is  a  measure  of  the  extent  to  which  the  processor  manifested  by  the  P3Q0  is 
activated,  or  ‘'utilized "  This  assumpaon  is  basic  to  the  ERP  literature  and  Verleger  appears  to 
accept  it.  It  must  be  admitted,  however,  char  we  have  no  direct  physiological  evidence  for  this  or  any 
other  interpretation  of  changes  in  the  amplitude  of  the  P300.  It  is  plausible  to  assume  that  the  larger 
the  potential  the  larger  the  voltages  generating  the  field  which  is  being  recorded  and  that  this  size  is 
a  measure  of  the  activation  of  the  tissue  cram  which  we  record.  This,  unfortunately,  is  not  neces¬ 
sarily  the  case  as  Allison  et  al  (1986)  point  out.  One  hopes  that  the  various  attempts  to  identify  the 
intracranial  sources  of  the  P300  will  provide  data  that  will  allow  an  interpretation  of  changes  in  the 
amplitude  of  the  P300  (e.g.  Halgrcn.  Stapleton,  Smith  and  Aitafuilah  1986;  Wood,  McCarthy, 
Squires,  Vaughan,  Woods,  and  McCall um  1984;  Okada,  Kaufman  and  Williamson  1983).  For  the 
tune  being,  however,  we  must  admit  that  the  assumption  is  entertained  primarily  for  its  heurisac 
value  and  its  merit  must  be  evaluated  according  to  its  utility  as  a  guide  for  future  research.7 

The  second  assumpaon  we  make  here  is  a  cognitive  one:  A  class  of  metacontrol  processes  is  con¬ 
cerned  with  maintaining  a  proper  representation  of  the  environment.  The  concept  of  a  "model  of  the 
environment''  is  of  course  not  new.  It  has  assumed  many  guises,  ranging  from  the  Sokolovian  "neu¬ 
ronal  model"  through  contemporary  notions  of  "schema"  and  "mental  models"  (Johnson-Laird 
1983).  [See  also  Kosslyn  et  ai.:  "On  the  Demystification  of  Mental  Imagery"  BBS  2(4)  1979  and 
Arbib:  "Levels  of  Modeling  of  Mechanisms  of  visually  guided  Behavior"  BBS  10(3)  1987.1  There 
are  very  meaningful  differences  among  ail  these  theoretical  concepts.  However,  they  have  one  gen¬ 
eral  idea  in  common,  namely,  that  pan  of  the  totality  of  representations  and  skills  potentially  avail¬ 
able  to  the  organism  at  any  rime  is  in  a  state  of  heightened  availability  and  that  this  high-availability 
segment  is  sensitive  to  the  conditions  of  the  environment  (the  "context")  and  to  the  needs  imposed 
by  the  tasks  at  hand. 

The  idea  that  a  model  of  the  environment  is  used  in  evaluating  incoming  information  and  in  select¬ 
ing  responses  is  quite  common  among  cognitive  theorists.  The  level  of  detail  with  which  such  con¬ 
cepts  are  presented  varies  and  the  specificity  with  which  models  are  articulated  is  also  quite  vari¬ 
able.  However,  it  is  really  not  critical  at  this  stage  to  specify  how  the  "contextual  model”  is  imple¬ 
mented.  Although  this  is  an  interesting  subject,  we  need  not  choose  among  the  many  candidates. 
The  only  assumption  we  need  to  make  is  that,  however  the  model  is  implemented,  there  must  be 
mechanisms  chat  will  maintain  it  as  an  accurate  model.  No  model  (or  schema)  would  be  of  any  use 
unless  it  was  possible  for  it  to  adjust  dynamically  to  the  environment.  When  the  "context"  changes, 
the  model  must  be  revised.  A  model  lacking  an  updating  component  would  be  useless  because  it 
would  fail  to  meet  its  most  basic  requirement,  namely,  that  it  should  reflect  the  ongoing  context.  As 
contexts  are  continually  in  flux,  the  model  must  undergo  constant  revisions.  Novelty,  surprise  and 
the  occurrence  of  improbable  events  must  somehow  be  integrated  into  it,  either  by  revising  its  map¬ 
ping  of  probabilities  on  the  environment  or  by  rejecting  the  importance  of  the  event  and  remaining 
unchanged. 

The  sensitivity  of  P300  to  the  probability  of  events  adds  plausibility  co  the  suggestion  that  it  is  asso¬ 
ciated  with  maintaining  the  schema.  Note  that  whether  the  decisive  fact  is  that  the  stimulus  is  "unex¬ 
pected"  or  is  that  it  is  associated  with  "closure"  makes  no  difference  at  this  sage  of  the  argument.  In 
either  case  we  mutt  hypothesize  a  functional  role  for  the  manifested  process.  We  suggest  that  the 
process  will  be  one  involved  in  the  maintenance  of  the  schema,  or  the  model,  of  the  current  context 
This  involves  no  commitments  about  how  this  goal  is  achieved;  it  leaves  the  details  of  the  operation 
unspecified.  We  note,  however,  the  importance  of  "task  relevance"  in  the  elicitation  of  the  P300. 
This  fact  must  constrain  speculations  about  the  underlying  processing. 


7  U  should  be  noted,  though,  that  the  mum  peon  does  gain  credence  bon  the  convergence  of  the  very  large  number 
of  conducted  under  us  guidsncc* 
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Q early,  not  all  improbable  changes  in  the  environment  lead  to  the  updating  of  the  context.  Only 
chose  segments  of  the  context  that  are  central  to  the  tasks  performed  by  the  subject  are  likely  to 
bring  about  the  changes  in  the  model  of  the  environment.  The  observed  changes  are  associated  with 
an  optional  limited-capacity  system.  The  constancy  of  the  amplitude  of  P3 00  across  tasks  as  the 
demands  of  the  tac^c  are  changed  reciprocally  (Wickens  et  al  1983)  is  strongly  supportive  of  the 
assodaaon  of  the  process  with  a  limited  capacity  system.  Two  studies  by  Johnson  and  Donchin 
(1978  1982)  illustrate  the  considerations  chat  enter  into  the  evoludoa  of  this  modcL 

Johnson  and  Donchin  (1978)  asked  subjects  to  press  a  button  exactly  one  second  after  a  light 
flashed  The  subject  was  informed  on  each  trial  by  one  tone  if  his  time  estimate  was  correct  and  by 
another  it  was  not.  The  intensity  difference  between  the  tones  was  the  independent  variable  in  the 
experiment.  It  turned  out  that  the  larger  the  difference  between  the  tones,  the  larger  the  P300  they 
elicited  The  very  same  tones,  when  used  in  an  oddball  task  (that  is,  when  the  subject  was  instructed 
to  count  the  number  of  tones  in  a  series),  elicited  a  P300  whose  amplitude  was  independent  of  the 
intensity  difference  between  them.  Johnson  and  Donchin  interpreted  these  data  as  indicating  that  the 
amplitude  of  P300  is  proportional  to  the  extent  that  the  stimuli  provide  the  subject  with  information 
that  can  help  in  the  performance  of  the  task.  However,  the  degree  to  which  the  information  is  util¬ 
ized  is  to  an  extent  under  the  subject's  control.  During  the  time  estimation  task  the  extraction  of 
information  from  the  tones  was  not  mandatory  and  the  information  was  less  likely  to  be  used  if  the 
discrimination  between  the  two  tones  placed  a  higher  load  on  the  subject.  When  directly  instructed 
to  count  tones  the  subjects  had  no  such  opoon  and  the  tones  were  used  regardless  of  their  discrimi- 
nability.  Within  the  framework  of  context  updating  we  would  suggest  that  the  subjects  maintained, 
as  pan  of  their  model  of  the  context,  a  basis  (a  template?)  against  which  they  estimated  dme.  When 
the  information  provided  by  the  tones  was  useful  in  maintaining  and  revising  this  model  the  tones 
were  so  used,  as  indicated  by  the  large  P300.  This  interpretation  was  bolstered  by  the  fact  that  the 
lower  the  P300,  the  more  errors  the  subjects  committed  and  the  less  likely  they  were  to  improve. 

In  a  subsequent  study,  Johnson  and  Donchin  (1982)  presented  subjects  with  an  oddball  series  in 
which  the  probabilities  were  changed  in  midcourse.  The  series  might  start  with  the  probability  of  the 
high  tone  at  .33  and  then,  in  40  to  80  trials,  the  probabilities  of  the  two  tones  would  be  reversed  so 
chat  che  probability  of  the  high  tone  became  -66.  This  sequence  of  reversals  continued  over  a  long 
series  so  that  when  viewed  over  the  entire  series  the  probabilities  of  the  two  stimuli  were  essentially 
equal  There  were  two  experimental  conditions.  The  subjects  were  first  run  without  knowing  that  the 
probabilities  were  changing.  They  were  told  to  count  high  (or  low)  tones  as  they  do  in  all  other  odd¬ 
ball  paradigms.  In  the  second  condition  the  subjects  were  told  that  there  was  a  continuing  sequence 
of  reversals  in  the  probability  of  the  two  tones  and  they  were  assigned  the  task  of  detecting  when  the 
probabilities  actually  changed.  The  aspect  of  the  result  relevant  to  the  present  discussion  concerns 
the  behavior  of  the  P30Q  during  the  "transition''  periods  which  were  defined  as  the  segment  of  the 
series  between  the  rime  the  computer  actually  adopted  the  reversed  probability  and  the  rime  the  sub¬ 
ject  detected  the  probability  shift 

As  Johnson  and  Donchin  used  identical  stimulus  sequences  in  the  two  conditions  it  was  possible  to 
compare  the  P300  elicited  when  the  subjects  knew  about  the  reversals  and  when  they  did  not  When 
the  P300  is  elicited  on  the  five  trials  immediately  preceding  the  subject’s  report  of  the  change,  it 
turns  out  that  during  the  unwitting  sequences  the  amplitude  of  the  P300  is  determined  entirely  by  the 
immediately  preceding  sequence  of  stimuli.  On  the  other  hand,  when  (he  subject  was  trying  to  iden¬ 
tify  the  Him  at  which  the  operating  parameters  of  the  environment  changed,  the  amplitude  of  P300 
in  those  five  trials  increases  gradually  from  trial  to  trial,  reaching  a  peak  just  prior  to  the  subject’s 
report  that  the  probability  has  changed.  Immediately  after  this  report  the  enhancement  of  P300 
disappears  and  the  amplitude  is  determined  entirely  by  the  global  probability  and  the  sequential 
dependencies  as  reported  by  Squires  et  al  (1976).  In  other  words,  as  the  stimulus  delivers  informa¬ 
tion  that  is  usable  in  the  subject’s  structuring  of  the  task  relevant  portion  of  the  environment,  the 
P300  is  enhanced.  As  the  evidence  accumulates  and  the  pressure  for  accepting  the  change  increases. 
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so  does  the  amplitude  of  die  P300,  reaching  a  peak  just  at  die  time  the  change  ia  accepted  and  the 
subject  is  made  sufficiently  aware  of  the  change  to  report  it. 

In  nummary  the  context  updating  model  asserts  nothing  more  than  that  the  P300  is  elicited  by 
processes  with  the  maintenance  of  our  model  of  the  context  of  the  environment.  The 

association  of  the  P300  with  novelty,  its  sensitivity  to  the  decaying  strength  of  representations,  its 
dependence  on  a  limited  capacity  system  and  its  strong  dependence  on  the  relevance  of  the  eliciting 
event  to  the  task  lead  to  the  suggestion  dm  die  processing  it  manifests  is  invoked  whenever  there  is 
a  need  to  revise  the  organism's  model  of  the  context.  We  are  assuming  that  the  larger  the  amplitude 
of  the  P300.  the  larger  the  change  in  the  model.  This  statement  of  our  model  is  admittedly  not  pre¬ 
cise  enough.  We  have  no  commitments  about  the  nature  of  the  system  that  implements  the  contex¬ 
tual  model  or  the  process  by  which  the  context  updating  is  implemented.  However,  we  think  it  is 
premature  to  offer  an  answer  to  these  questions.  We  need  to  obtain  much  more  evidence  on  the  P3Q0 
and  its  relation  to  context  updating  before  we  can  begin  to  discuss  such  details.  However,  even  at 
this  sage  the  context  updating  model  is  powerful  enough  to  generate  interesting  experimental  pred¬ 
ictions  about  the  consequences  of  the  P300. 


4.2.  L 

In  the  next  secdoa  we  will  examine  some  of  these  predictions.  First,  however,  we  must  examine 
Verieger's  description  of  the  context  updating  model  Two  versions  of  the  model  are  subjected  to 
Verleger's  critique.  The  first,  the  "Update"  version,  is  arrived  at  by  a  tortuous  route  which  leads  Vcr- 
leger  from  a  statement  of  the  model  as  presented  above  to  an  insistence  that  we  specify  "what 
exactly  was  meant  by  ’ expectancies. '  'strategies,'  'schema.'  'model'  and  'context'  (p.  4)"  The 
definitions  are  given  whenever  we  conduct  an  empirical  investigation.  For  example,  "expectancy" 
was  operationally  defined  by  Squires  et  al  (1976).  Ia  any  event,  Verleger  proceeds  to  use  out-of- 
context  remarks  about  working  memory  and  representations  to  produce  the  following  version  of  our 
model 

"Each  stimulus  has  its  representation  ('trace')  in  working  memory,  which  decays  over  time. 

After  stimulus  evaluation,  the  trace  of  the  presented  stimulus  is  updated  to  its  optimal  upper- 

limit  value  and  it  is  this  updating  that  is  reflected  by  P3.  p.  5" 

Verleger  is  clearly  attributing  much  more  specificity  to  us  than  we  feel  competent  to  provide.  We 
have  definitely  not  described  a  direct  association  between  P300  and  any  specific  operation  involved 
in  the  maintenance  of  the  contextual  model.  We  certainly  would  not  claim  to  know  that  P300  is  a 
manifestation  of  the  updating  of  traces.  In  the  present  state  of  our  knowledge  we  cannot  go  beyond 
the  more  general  suggestion  that  P300  is  invoked  because  of  the  need  to  update  the  context.  The 
narrow  definition  according  to  which  Verleger  focuses  on  the  very  set  of  trace  updating  leads  him  to 
see  inconsistencies  where  none  exist  Thus,  for  example,  the  flexibility  of  the  sequential  effects  is 
only  a  problem  if  one  assumes  that  P300  is  uniquely  related  to  trace  updating.  This  poses  no  prob¬ 
lem.  however,  if  the  need  to  update  the  context  arises  in  some  circumstances  and  not  in  others. 
Surely  Verleger's  discovery  that  "The  largest  P3s  were  evoked  by  the  last  relevant  item  of  every 
trial"  is  precisely  what  has  been  reported  by  Doochin  and  Cohen  (1967),  or  by  Rohrbaugh.  Donchin 
and  Erikscn  (1974).  We  suggest  that  these  evens  are  the  ones  impelling  the  system  to  update  the 
model  of  the  coo  text. 

A  somewhat  closer  approximation  to  our  views  is  given  by  Verleger  when  he  discusses  his  "Expect" 
version  of  context  updating.  He  acknowledges  that  the  model  as  presented  by  Donchin  (1981)  "goes 
beyond  Update’s  interpretation  of  expectancies  being  representations  of  stimuli  in  working 
memory."  He  sees  that  we  are  suggesting  "strategies  and  expectancies  may  involve  working 
memory,  but  they  do  not  hold  any  privileged  relation  to  working  memory.”  However,  he  immedi¬ 
ately  proceeds  to  narrow  the  definition  so  that  it  is  again  a  caricature,  rather  than  a  valid  portrayal  of 
our  model  Verleger's  strategy  as  he  discusses  "Expect"  is  to  suggest  that  subjects  "expect  the 
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frequent  stimulus”  sod  arc  consequently  "surprised”  by  the  rue  stimulus.  First  portraying  the  con¬ 
text  updating  model  as  an  claim  about  the  relationship  between  violations  of  expectancies  and  P300, 
Verleger  proceeds  to  use  the  substantial  body  of  data  suggesting  that  stimuli  need  not  be  entirely 
unexpected  to  counter  his  own  version  of  our  model.  This  is  as  misleading  as  his  treatment  of 
"Update.”  We  have  repeatedly  sated  that  low  probability  is  neither  a  sufficient  nor  a  necessary  con¬ 
dition  for  eliciting  P’300  or  for  controlling  its  amplitude.  It  is  inherent  in  the  muldplicadve  relation¬ 
ship  between  rarity  and  ask  relevance  that  stimuli  with  high  relevance  will  elicit  a  large  P300  even 
when  highly  probable. 

4  J.  Predictions  from  the  Model 

4.3.1.  The  amplitude  of  P300  and  response  bias 

The  most  important  corollary  of  the  context  updating  model  is  that  it  directed  our  attennon  to  the 
study  of  the  possible  effect  of  eliciting  P300  on  subsequent  performance  through  the  restructuring  of 
the  subject's  model  of  the  environment.  This  approach  was  guided  by  the  realization  articulated  ear¬ 
lier  in  this  precommencary  that  theories  about  an  ERP  component  must  be  tested  by  assessing  the 
consequences  of  eliciting  the  component.  If  the  process  manifested  by  the  component  is  involved  in 
the  restructuring  of  the  subject’s  model  of  the  environment  we  ought  to  be  able  to  measure  effects 
that  can  be  attributed  to  such  changes  and  their  magnitude  should  be  proportional  to  the  amplitude 
of  the  P300. 

As  an  illustration,  consider  the  experiment  reported  by  Donchin,  Granon,  Dupree  and  Coles  (in 
press).  In  this  study  names  were  displayed  on  a  screen  monitored  by  the  subjects,  who  were 
instructed  to  press  one  of  two  buttons  in  response  to  names  of  males  and  the  other  button  in  response 
to  names  of  females.  Female  names  appeared  80%  of  the  time  and  subjects,  who  were  urged  to  per¬ 
form  quickly,  were  strongly  biased  to  press  the  button  indicating  a  female  name.  In  general,  subjects 
hardly  ever  erred  in  response  to  the  female  names  whereas  they  often  pressed  the  wrong  button  in 
response  to  a  male  name.  These  erroneous  responses  were  clearly  "fast  guesses.”  The  ERPs  elicited 
by  the  rare  male  names  showed  an  interesting  pattern.  Their  P300  latency  was  about  100  msec 
greater  than  that  of  rare  names  correctly  responded  to.  In  other  words,  a  fast  guess  delayed  the  P300 
by  about  100  msec. 

What  was  the  subject  doing  in  these  100  msec?  Obviously,  no  direct  answer  is  available.  It  is  plausi¬ 
ble,  however,  that  this  extra  period  was  used  to  assess  the  consequences  of  the  error  trial  so  that 
proper  adjustments  could  be  made  to  the  operating  context  to  minimize  such  errors  in  the  future. 
We  reasoned  that  if  the  P300  was,  as  our  model  assumes,  a  manifestation  of  processing  associated 
with  the  adjustment  of  the  operating  context,  then  it  is  reasonable  to  predict  that  the  larger  the  P30Q 
the  larger  the  adjustment.  The  most  likely  adjustment  after  the  recognition  of  an  error  would  be  a 
revision  of  the  biases  so  that  errors  would  be  less  likely  in  the  future.  "If  this  hypothesis  is  correct 
then  the  characteristics  of  the  P300  elicited  on  error  trials  should  predict  variations  in  response  cri¬ 
teria  in  the  following  trials*  (Doochin  et  aL,  in  press). 

We  accoprdinglly  examined  all  the  trials  oo  which  the  rare  stimulus  was  presented  and  on  which  the 
subjects  and  and  sorted  these  according  to  their  response  oo  the  trial  on  which  the  next  rare  name 
was  presented.  There  could  be  pain  in  which  an  error  was  followed  by  an  error  and  pain  in  which 
the  error  was  followed  by  a  correct  response.  The  critical  finding  was  that  the  amplitude  of  the  P'300 
elicited  by  the  first  member  of  the  pair  was  larger  if  the  response  to  the  second  member  was  correct. 
On  the  assumption  that  the  subject  responds  in  error  to  male  names  because  he  is  biased  to  respond 
to  the  more  frequent  female  names,  we  propose  that  a  correct  response  to  the  male  names  implies 
chat  the  subject  has  adjusted  the  bias  as  a  consequence  of  processing  associated  with  the  error.  It  is 
not  implausible  to  suggest  that  the  amplitude  of  the  P300  is  a  manifestadoa  of  a  process  associated 
with  this  adjustment.  This  interpretation  received  further  confirmation  from  the  observation  that  the 


RT  associated  with  the  ant  female  name  after  a  male  name  is  inversely  proportional  to  the  amp  11* 
aide  of  the  P300  elicited  by  that  male  name. 

Note  that  we  are  not  suggesting  that  the  P300  is  a  manifestation  of  the  bias  adjustment  per  se.  The 
precise  relationship  between  bias  adjustment  and  the  process  manifested  by  the  P300  is  undeter¬ 
mined  at  this  point  All  we  are  claiming  is  that  in  the  complex  of  processes  triggered  by  the  recogni¬ 
tion  that  an  error  has  been  committed  (which  appears  to  delay  the  elicitation  of  the  P300),  an  opera- 
non  whose  effectiveness  is  correlated  with  the  amplitude  of  the  P300  takes  place  and  this  operation 
predicts  rather  reliably  whether  or  not  the  subject  will  commit  yet  another  error. 

4 Perfect  Pitch  and  P300 

.Another  prediction  derived  from  our  model  was  tested  by  Klein  et  al  (1984).  We  predicted  that  sub¬ 
jects  with  perfect  (or  "absolute ')  pitch  would  not  display  a  P’300  when  tested  with  an  auditory  odd¬ 
ball  senes  but  would  show  a  P300  when  tested  with  a  visual  one.  The  results  confirmed  our  predic¬ 
tion.  We  did  not  invent  the  idea  that  such  subjects  maintain  permanent  templates  in  their  memory 
for  acoustic  stimuli.  This  happens  to  be  a  common  account  of  the  Perfect  Pitch  phenomenon.  As 
Klein  et  al  noted  : 

The  weight  of  the  evidence  suggests  that  Individuals  with  Absolute  Pitch  (AP)  have  access  to 
a  set  of  internal  'standards'  that  allows  them  to  fetch  the  name  of  a  tone  without  comparing  the 
representation  of  the  tone  to  a  recently  fetched  representation  of  a  standard  (Siegel  1974).’ 

Lockhead  and  Byrd  (1981;  see  also  Hamad  1987)  review  the  relevant  literature  and  describe  a 
method  for  assessing  the  degree  to  which  an  individual  does  indeed  have  this  skill.  We  arrived  at  the 
prediction  that  AP  subjects  would  not  use  the  mechanism  manifested  by  the  P300  by  combining  this 
common  account  of  AP  skill  with  our  current  view  of  P300.  We  proceeded  to  test  music  students, 
wuh  and  without  AP.  and  the  data  clearly  confirmed  the  prediction. 

As  we  noted  earlier.  Verteger  overlooks  the  fact  that  all  subjects  were  music  students  and  that  the 
amplitude  of  P30Q  was  proportional  to  each  subject’s  actual  skill  as  an  AP  percei ver  rather  than  to 
the  subjects’  reports  about  his  skill.  Verleger  also  astern: 

"If  tones  have  permanently  resident  representations  for  AP  subjects,  why  not  other  well-known 
stimuli  such  as  colon,  the  words  ‘push’  and  ’wait.’  synonyms  and  names?  Since  these  words 
evoke  P3s  Klein  et  al  assumption  is  improbable,  p.  8" 

Verleger  seems  unwilling  to  accept  the  overwhelming  evidence  that  people  with  AP  have  a  rather 
unique  ability  that  cannot  be  explained  simply  by  assuming  that  they  have  over! earned  a  specific  set 
of  well-known  stimuli.  As  we  tested  our  subjects  in  the  way  Lockhead  and  Byrd  did  the  differences 
struck  us  with  remarkable  force.  The  subjects  without  AP  were  all  highly  qualified,  advanced,  music 
students.  They  had  excellent  relative  pitch  judgment.  That  is,  if  allowed  to  hear  a  standard  tone  and 
given  its  name,  they  could  proceed  to  name  any  ocher  tone  with  accuracy.  However,  without  such 
prior  exposure  to  the  standard,  subjects  with  relative  pitch  could  only  name  correctly  a  small  percen¬ 
tage  of  the  tones.  Subjects  with  AP  named  between  82%  and  94%  of  the  tooes  without  access  to  a 
reference  tone.  It  is  inappropriate  to  compare  the  way  AP  subjects  process  tones  to  the  way  we  pro¬ 
cess  other  "well-known''  stimuli,  The  evidence  suggests  that  the  AP  skill  relies  on  a  unique  com¬ 
parison  with  a  permanently  available  standard.  Our  data  show  that  these  subjects,  as  predicted  by  the 
context  updating  model,  draw  on  the  processes  underlying  (he  P300  to  a  significantly  smaller  extent 
that  they  do  when  processing  visual  stimuli, 

We  note  in  passing  that  in  the  course  of  discussing  the  empirical  data  Verleger  makes  a  number  of 
statements  that  suggest  he  may  have  a  rather  limited  model  of  the  cognitive  system.  For  example,  in 
his  discussion  of  Klein  et  al’s  paper,  he  wonders  why  the  subjects  appear  to  make  the  visual  com¬ 
parisons  by  relying  on  a  retrieved  standard.  He  than  writes: 
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'If  Klein  e:  ai's  subjects  were  unable  to  form  the  abstract  categories  ’H*  and  ’S'  but  had  to 
store  the  specific  physical  features,  how  can  other  subjects  combine  stimuli  into  targets  and 
nontargets  irrespective  of  physical  features?  Since  this  is  obviously  possible,  the  assumption  is 
also  improbable,  p.  8" 

Here  again  Verleger  does  not  quote  us  accurately.  We  did  not  say  that  subjects  were  unable  to  form 
the  categories  ’H’  and  ’S.’  Rather,  we  suggested  that  when  a  displayed  character  is  assigned  to  one 
or  another  category  an  encoding  process  must  precede  the  categorization  and  must  use  a  feature 
detection  scheme  that  takes  into  account  the  large  variety  of  fonts  and  shapes  in  which  the  charac¬ 
ters  may  be  presented.  That  such  a  feature-detection  stage,  possibly  preattendve,  precedes  the 
higher-level  categorizations  has  been  demonstrated  rather  persuasively.  For  a  comprehensive  review 
of  much  of  the  relevant  data  the  reader  is  referred  to  Treisman  (1986).  It  is  also  becoming  increas¬ 
ingly  clear  that  much  of  the  data  on  categorization  can  be  interpreted  without  assuming  that  subjects 
form  "abstract  categories"  represented  by  prototypes;  they  may  be  using  an  exemplar  based  scheme 
that  is  very  dependent  on  feature  encoding  schemes  (Medin  and  Wattenmaker  1987;  see  Hamad 
1987  for  further  kinds  of  representational  model).  Thus,  it  seems  the  cognidve  assumptions  of  Klein 
et  al  are  consistent  with  at  least  some  current  theories  of  categorization. 

Equally  strange  is  Verleger' s  nodon  of  Working  Memory.  We  quote  in  full,  so  the  reader  can  assess 
Verleger' s  line  of  thinking: 

"Working  Memory  (WM)  contains  anything  that  has  happened  during  the  previous,  say,  ten 
seconds.  Then  WM  will  be  relatively  empty  when  a  trial  starts.  Storing  the  first  stimulus  in 
WM  will  change  WM  content  from  zero  to  one,  storing  the  second  stimulus  will  change  its 
content  from  one  to  two,  storing  the  last  word  of  a  seven  word  sentence  will  change  its  content 
from  six  to  seven,  p.  10" 

Verleger  appears  to  view  WM  as  a  simple  push  down  stack  which  can  be  "relatively  empty"  at  any 
time  and  into  which  simple  representations  of  stimuli  are  pushed,  and  from  which  they  are  popped, 
as  they  are  presented  and  used.  This  is,  of  course,  a  rather  extreme  simplification  of  the  WM  model 
of  those  who  have  been  using  the  concept.  There  is  almost  no  relation  between  Verleger’s  WM  and 
the  dynamic  and  complex  system  proposed,  for  example,  by  Baddeley  (1987),  in  which  executive 
controllers  oversee  and  integrate  the  activity  of  articulatory  loops  and  visual  scratch  pads,  all  the 
while  monitoring  the  environment  for  relevant  inputs.  Our  own  view  of  the  system  was  summarized 
by  Donchin  ( 1 98 1 )  as  follows: 

"The  schema  may  be  conceptualized  as  a  large  and  complex  map  representing  all  available 
data  about  the  environment.  It  is  the  reservoir  of  information  that  is  necessary  for  performing 
whatever  tasks  require  active  processing  at  any  time.  Some  of  the  information  has  just  been 
delivered  and  may  still  reside  in  various  stages  of  dynamic  memory.  Other  information  resides 
in  longer  term  memory  and  may  need  to  be  rna^<*-  available,  on  a  temporary  basis,  for  integra¬ 
tion  into  the  overall  schema.  The  system  is  quite  fluid.  There  must  be  some  priority  weighting 
system  that  is  associated  with  the  schema  and  determines  which  of  its  aspects  is  related  to 
which  tasks.  Representations  decay  because  of  misuse  or  because  of  shifting  strategies  and 
tasks.  New  mformanon  is  brought  in.  Choices  are  in  the  process  of  using  fhi<  schema. 
When  there  is  need,  the  model  is  revised  by  building  novel  representations  through  the  incor¬ 
poration  of  incoming  data  into  the  schema  based  on  long  term  memory  dvn  It  is  likely  that  it 
is  this  updating  process  that  we  see  manifested  by  the  P300.  p.  508." 

4J  J.  P300  and  the  von  Restorff  effect 

It  is  this  more  dynamic  view  of  ’working  memory’  that  we  espouse  as  we  consider  the  functional 
explanation  of  the  patterns  of  P300  data  and  it  is  this  approach  that  guided  the  last  set  of  predictions 
about  the  P300  we  will  discuss.  We  refer  to  studies  of  the  relation  between  the  P30G  amplitude  eli¬ 
cited  by  stimuli  and  the  probability  that  those  stimuli  will  be  recalled.  Donchin  (1981)  noted  that 
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there  may  be  a  relation  between  theories  of  memory  predicting  that  distinct  items  will  be  best 
remembered  and  the  fact  the  P300  is  elicited  by  distinctive  events.  It  accordingly  teemed 
promising  to  examine  the  extent  to  which  the  amplitude  of  P300  —  interpreted  either  in  this  frame- 
work,  as  a  measure  of  distinctiveness,  or,  altemadvely,  ss  a  measure  of  the  restructuring  of  the 
current  model  of  the  environment  —  might  predict  the  probability  that  items  would  be  remembered. 

The  paradigm  that  proved  particularly  effective  for  testing  this  prediction  was  devised  by  von  Res* 
torff  (1933).  She  presented  subjects  with  a  series  of  items,  some  distinct  in  some  respect  These  "iso¬ 
lates. '  she  reported,  were  remembered  far  better  than  the  other  items  in  the  series.  Kahs  et  al  (1984) 
reasoned  that  these  isolates  would,  by  virtue  of  their  very  distinctiveness.  elicit  a  P3Q0.  They  further 
reasoned  that  the  amplitude  of  these  P300s  would  vary  and  that  this  variance  would  be  correlated 
with  the  subjects’  ability  to  recall  the  isolates. 

To  test  this  prediction  Karts  et  al  presented  each  subject  with  40  series  of  IS  words  each.  In  30  of 
these  lists  one  of  the  words,  in  one  of  the  middle  positions,  was  displayed  in  characters  that  were 
either  larger  or  smaller  than  the  characters  used  for  the  other  items  in  the  list  The  subjects  were 
asked  to  recall  the  items  immediately  after  each  list  was  presented,  as  well  as  at  the  end  of  the  entire 
senes  of  presentations.  At  a  later  time,  the  subjects'  ability  to  recognize  the  previously  presented 
words  was  also  tested.  Avenge  ERPs  were  obtained  separately  for  the  recalled  and  unrccalled 
words.  The  pattern  of  results  was  rather  complex.  For  present  purposes  it  is  sufficient  to  note  that  the 
amplitude  of  the  P300  was  indeed  larger  for  recalled  items.  However,  this  was  in  the  main  true  only 
for  those  subjects  who  used  rote  memorization  to  aid  their  recall.  The  relation  between  P’300  and 
recall  did  not  hold  for  subjects  who  used  elaboration  techniques  as  a  tool  to  aid  their  recall. 

The  interaction  between  recall  strategies  on  the  one  hand  and  the  P3 00/recall  relationship  on  the 
other  was  confirmed  in  a  subsequent  study  by  Fabiani  et  al  (198 6),  who  replicated  the  Karis  et  al 
study,  except  that  this  time  the  subjects  were  explicitly  instructed  to  use  one  strategy  or  the  other. 
All  subjects  were  instructed  (in  a  properly  counterbalanced  way)  to  use  all  strategies.  It  was  found 
that  the  aforementioned  interaction  between  P300,  recall  strategies  and  recall  holds  in  subjects.  In 
another  experiment.  Fabiani,  Karis  and  Donchin  (1986)  used  an  incidental  learning  paradigm  in 
which  subjects  had  no  reason  to  use  elaborative  strategies.  For  all  subjects  the  amplitude  of  the  P300 
elicited  by  subsequently  recalled  items  was  larger  than  the  amplitude  elicited  by  items  that  were  not 
recalled.  Filler,  Kutas  and  Mayes  (1987)  report  similar  results  using  a  "level  of  processing"  pan* 
digm.  Confirming  findings  were  also  reported  by  Fox  and  Michie  (1987). 

To  what  extent  do  these  data  provide  support  for  the  context  updating  model?  If  the  model  is  con¬ 
strued  as  an  claim  about  the  class  of  functions  the  P300  is  most  likely  to  manifest,  then  the  strong 
relationship  between  P300  amplitude  and  subsequent  recall  is  certainly  consistent  with  the  sugges¬ 
tion  the  process  underlying  the  P300  is  involved  in  the  creation  and  maintenance  of  representa¬ 
tions.  In  its  simplest  form  this  assertion  anchors  the  P300  in  the  family  of  processes  that  make  "dis¬ 
tinctiveness"  so  powerful  a  factor  in  determining  recall  and  recognition  of  items-  Even  more  impor¬ 
tant  is  the  interaction  between  P300  and  the  subject’s  recall  strategies.  Use  importance  of  this  mult 
is  that  it  places  constraints  on  the  range  of  functions  that  might  be  associated  with  the  P300.  When 
stripped  of  the  necessary  complexities,  the  fact  that  the  effect  we  observe  is  restricted  to  subjects 
who  use  tote  memory  suggests  that  the  interaction  in  which  the  P300  process  plays  a  role  must  take 
place  at  one  of  the  earlier  levels  of  stimulus  encoding.  It  is  the  representation  of  the  actual  physical 
stimulus,  rather  than  some  subsequently  encoded  representation,  that  is  affected  by  the  intensity 
with  which  the  process  wwiifowd  by  the  P300  Is  activated.  Fabiani  et  al  (in  preparation)  tested  a 
corollary  of  this  interpretation.  They  assessed  the  degree  to  which  subjects  actually  recall  the  sixe  of 
the  original  stimuli  presented  during  the  study  period.  It  turns  out  that  when  subjects  use  rote 
memory  they  are  more  likely  to  recall  the  original  size  of  the  word  than  are  the  very  same  subjects 
when  they  "elaborate."  The  implication  is  clear  -  the  P300's  effect  is  exercised  at  the  early  represen¬ 
tational  sage  of  stimulus  encoding. 


The  details  of  the  process  need  further  elaboration.  Here,  as  in  previous  discussions,  we  do  not  find 
it  necessary  to  push  the  theoredcal  analysis  beyond  the  depth  needed  to  move  ahead  with  our  experi- 
mental  program.  Indeed,  we  view  theory  largely  as  a  guide,  a  very  rough  guide,  to  our  experimental 
program.  Rather  than  propose  all-encompassing  systems  as  accounts  of  the  P300,  systems  in  which 
every  item  is  fully  defined  and  every  operation  entirely  predictable,  we  prefer  theories  that  at  any 
stage  of  the  research  are  sufficiently  precise  to  allow  the  next  stage  of  the  program  to  establish  itself. 

5.  Summary 

In  conclusion,  we  remain  persuaded  that  our  approach  to  the  study  of  the  endogenous  components  of 
the  ERP  is  sound.  Our  goal  is  to  understand  the  functional  significance  of  these  components.  In  this 
precommentary  we  focused  on  the  P300  because  Verleger’s  critique  was  concerned  with  our  views 
of  this  component.  However,  our  remarks  are  more  general  in  their  intent.  We  believe  that  accounts 
of  these  phenomena  should  take  the  form  of  hypotheses  about  the  informadon  processing  functions 
of  the  aedvides  manifested  by  the  component.  These  should  be  derived  from  a  knowledge  of  the 
antecedent  conditions  of  the  components;  however,  an  enumeration  of  these  conditions,  no  m»rn»r 
how  general,  does  not  in  itself  constitute  a  theory  of  the  component  Once  a  theory  is  proposed,  it 
should  be  assessed,  not  by  arguments  about  the  plausibility  of  post  hoc  explanations,  but  by  predic¬ 
tions  about  the  consequences  of  the  component 

Verleger’s  two  versions  of  our  model  do  not  represent  our  views  adequately  Nrniy  they  are  ha  sod 
on  a  very  narrow  and  simplistic  construal  of  terms  such  as  "expectancy"  and  "working  memory." 
The  theory  Verleger  proposes  is  not  a  theory  but  rather  a  generalization  about  the  antecedent  condi¬ 
tions  of  the  P300.  The  target  article  also  ignores  rather  important  segments  of  the  literature  that  are 
crucial  to  an  understanding  of  the  P300,  and  those  studies  that  are  Hio-my/i  are  mostly 
misrepresented  and  misinterpreted. 

In  this  precommentary  we  present  the  context  updating  model  as  we  actually  construe  it  and  we 
review  some  of  the  studies  that  have  been  generated  by  our  version  of  the  model.  It  will  be  seen  char 
whereas  the  model  is  indeed  vague  about  the  specific  mechanisms  manifested  by  the  P300,  it  is 
specific  with  respect  to  the  classes  of  functions  likely  to  be  manifested  by  the  P300.  More  important, 
the  model  has  served  as  a  tool,  guiding  our  empirical  investigations  into  a  number  of  exciting 
research  avenues  and  leading  to  a  number  of  interesting  discoveries.  This,  to  us,  is  the  ultimate  test 
of  a  successful  model. 
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Introduction 

A  persistent  theme  in  Lindsley' s  writings  has  been  the  focus  on  the 
temporal  character! sties  of  psychophysiol ogical  signals.  His  second 
published  paper  (Herren  4  Lindsley,  1931)  was  concerned  with  the  latencies 
of  tendon  reflexes.  This  interest  in  latencies  continually  appears  in 
subsequent  studies  (e.g..  Chalupa,  Rohrbaugh.  Gould,  4  Lindsley.  1974; 

Oonchin  4  Lindsley.  1965;  Lindsley.  1944;  Lindsley,  1954;  Lindsley.  1982; 
Lindsley  4  Drmons.  1958;  Lindsley.  Fehmi .  4  Adkins.  1967;  Lindsley  4 
Rubenstein,  1937;  Lindsley.  Seales,  4  Wilson,  1973;  Schwartz  4  Lindsley. 
1964).  This  chronometric  approach,  that  has  played  a  critical  role  in  many 
areas  of  Psychology  (Posner,  1978),  makes  the  seemingly  paradoxical 
assumption  that  even  i f  we  do  not  know  what  happened  it  is  useful  to  know 
when  it  happened.  The  relative  timing  of  events,  the  rhythm  with  which  they 
occur  and  the  factors  that  increase,  or  decrease,  the  speed  with  which  the 
events  are  triggered,  can  be  useful  in  the  analysis  of  a  system  even  if  we 
are  not  quite  ready  to  provide  a  full  account  of  the  processes  that  underlie 
the  events.  It  is  clear,  for  example,  that  when  Shakespeare,  in  the 
Sixteenth  Century,  has  Leontes  in  The  Winter's  Tale  (Act  I  sc.  2)  say:  "I 
have  a  tremor  cordis  on  me:  my  heart  dances;  But  not  for  joy;  not  joy...." 
he  is  expressing  a  recognition  of  a  common  relationship  between  an 
accelerated  heart  rate  and  "joy.*  It  is  also  evident  that  Shakespeare  is 
aware  that  even  though  the  dancing  heart  is  commonly  an  Indicator  of  "joy" 
this  relationship  can  break  down.  The  accuracy  and  utility  of  this 
"psychophysiological"  implication  of  Shakespeare's  does  not  depend  on  his 
knowing,  as  he  obviously  does  not,  the  physiology  of  the  heart  and  of  the 
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control  of  the  heart's  rhythms.  The  validity  of  such  psychophysiological 
observations  depends  on  the  careful  observation  of  the  temporal 
characteri sties  of  the  physiological  events  (“my  heart  dances"),  on  the 
proper  definition  of  the  psychological  concepts  ("not  joy"),  and  on  the 
theory  within  whose  framework  these  relationships  play  a  role. 

In  his  work  on  the  Excitability  Cycles,  in  studies  of  backward  masking 
and  in  the  examination  of  the  various  rhythms  of  the  EEG.  with  particular 
emphasis  on  the  alpha  rhythm,  Lindsley  has  shown  how  it  is  possible  to  use 
the  timing  of  psychophysiological  signals  in  the  study  of  the  mind  even  if 
the  nature  of  the  processes  observed  by  the  psychophysiologist  are  not 
fully,  or  even  partially,  understood.  The  point  is  simple.  A  well  defined 
psychophysiological  response  is  an  event.  Once  we  have  identified  the 
occurrence  of  an  event,  we  are  in  possession  of  information  about  its  time 
of  occurrence,  even  if  we  do  not  know  its  causes  or  its  nature.  These 
temporal  data  can  serve  as  dependent  variables  in  our  studies.  If  our 
theories  generate  differential  predictions  about  the  variation  of  these 
temporal  variables  as  a  function  of  properly  selected  independent  variables, 
these  data  can  play  a  critical  role  in  the  understanding  of  the  mind. 

This  point  of  view  is  illustrated  by  the  widespread  use  of  the  latency 
of  components  of  the  Event-Related  Brain  Potentials  (ERP).  Thus,  for 
example,  Hillyard  and  his  associates  demonstrated  that  a  differentiation 
between  the  ERPs  elicited  by  attended  and  Ignored  stimuli  appears  as  early 
as  the  first  100  ms  following  the  eliciting  stimuli  (Hillyard.  1984).  The 
observation  of  Hillyard' s  that  makes  a  difference  for  a  theory  of  Attention 
is  the  observation  about  the  timing  of  the  ERP  component,  and  this 
theoretical  point  is  largely  unaffected  by  debates  about  the  precise  nature 
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of  the  ERP  components  observed  (Naatanen,  1982).  Another  ERP  component 
whose  latency  has  proven  a  useful  tool  in  the  analysis  of  cognitive  function 
is  P300.  In  this  chapter  we  illustrate  the  richness  of  the  information  the 
latency  of  the  P300  yields  regarding  processes  that  are  essentially  opaque 
to  the  more  traditional  tools  of  cognitive  psychology.  We  will,  for  the 
purposes  of  this  illustration,  provide  a  partial  description  of  a  study  that 
will  be  reported  in  full  elsewhere  {Grattan,  Coles.  Dupree  and  Donchin, 
1985).  The  analysis  of  P300  timing  will  be  made,  in  this  chapter,  from  a 
perspective  of  a  theory  of  P300  that  makes  specific  predictions  regarding 
the  consequences  of  changes  in  an  internal  process,  changes  that  are 
manifested  at  the  scalp  by  changes  in  the  amplitude  of  the  P300  (Donchin. 
1981).  As  the  predictions  are  confirmed  by  the  data,  the  study  lends 
support  to  our  interpretation  of  the  P30Q. 

The  study  we  discuss  is  one  in  a  series  employing  the  "Oddball " 
paradigm  in  which  the  stimuli  are  names  of  individuals  commonly  used  in  the 
American  culture.  In  all  cases  the  series  are  constructed  so  that  20%  of 
the  names  were  names  of  males  (e.g.,  David.  Henry.  Thomas...).  The  other 
names  used  in  the  series  are  commonly  associated  with  females  (e.g.,  Nancy. 
Helen,  Susan...).  On  some  occasions,  the  subject  is  required  to  count  the 
number  of  names  that  fell  in  one  or  another  category  (a  count  condition). 

On  other  occasions  the  subject  indicates  the  occurrence  of  one  of  the 
categories  by  pressing  one  of  two  buttons  (a  Reaction  Time,  or  RT. 
condl ti on) . 

The  initial  study  in  this  series  was  reported  by  Kutas.  McCarthy  and 
Donchin  (1977).  Their  subjects  were  presented  with  3  different  Oddball 
series.  A  "Variable  Names"  series  was  constructed  from  names  of  males  and 
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females  as  described  in  the  previous  paragraph.  A  “Fixed  Names"  series 
included  just  the  names  “David"  and  "Nancy."  The  third  series  was  a 
sequence  of  words.  20%  of  which  were  synonyms  of  “prod."  In  the  latter 
case,  the  subject's  task  was  to  press  one  button  in  response  to  such 
synonyms  and  to  press  another  button  in  response  to  all  other  words.  The 
rare  events  in  each  series  elicited  a  large  P300.  This  was  true  regardless 
of  tne  specific  task  assigned  to  the  subject. 

It  turned  out  that  the  latency  of  the  P300  varied  across  the  three 
conditions.  This  was  particularly  evident  when  the  subjects  were  instructed 
to  be  accurate.  The  shortest  latency  was  observed  when  the  subject 
discriminated  between  the  two  names.  David  and  Nancy.  A  longer  latency  was 
seen  when  the  names  varied  from  trial  to  trial.  The  longest  latency  was 
associated  with  the  need  to  decide  whether  each  of  a  rather  disparate  list 
of  words  is  a  synonym  of  "prod."  These,  and  a  considerable  amount  of 
additional  data,  lead  us  to  suggest  that  the  latency  of  the  P3Q0  depends  on 
the  time  required  for  the  evaluation  of  the  stimulus  and  is  independent  of 
response  selection.  Subsequent  work  (McCarthy  4  Donchin.  1981)  provided 
strong  support  to  the  assertion  that  the  latency  of  P300  is  largely 
independent  of  the  duration  of  processes  that  are  involved  in  the  execution 
of  the  response.  The  Interesting  conclusion  from  the  data  reported  by  Kutas 
et  al .  (1977)  has  been  that  the  latency  of  P300  is  proportional  to  the  time 
it  takes  to  categorize  the  stimuli.  If  this  Is  the  case,  the  P300  latency 
may  be  used  as  a  tool  In  mental  chronometry  to  measure  mental  timing 
uncontaminated  by  "motor"  processes  (Coles,  Gratton,  Bashore,  Eriksen,  4 
Donchin.  1985;  Donchin,  1981;  Magliero,  Bashore.  Coles.  4  Donchin,  1984; 
McCarthy  4  Donchin,  1981).  For  studies  in  which  P300  latency  is  indeed 
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utilized  in  this  fashion  see  Duncan-Johnson  and  Donchin  (1982),  Pfefferbaum, 
Ford.  Roth,  and  Kopell  (1980),  Goodin,  Squires,  and  Starr  (1978),  and 
Pfefferbaum,  Ford,  Roth,  and  Kopell  (1980)*. 

Kutas  et  al .  (1977)  examined  the  relationship  between  the  latency  of 
P300  and  the  RT  associated  with  each  of  the  trials  in  an  oddball  study  using 
names,  sorted  according  to  gender  (see  also  McCarthy,  Kutas.  &  Donchin. 

1979).  Their  analysis  capitalized  on  a  filtering  technique  that  allowed  the 
measurement  of  the  latency  of  P300  on  individual  trials  (Woody.  1967).  The 
principal  finding  was  that  the  correlation  betv*en  P300  latency  and  RT 
depends  on  the  strategy  adopted  by  the  subjects.  When  the  subjects  were 
instructed  to  be  accurate  the  correlation  between  P300  latency  and  RT  was 
larger  (.61)  than  it  was  when  they  were  instructed  to  be  fast  (.48).  These 
data  support  the  suggestion  (Donchin,  1979;  1981)  that  the  P300  and  the 
motor  response  may  each  be  the  product  of  a  series  of  processing  activities 
and  that  these  streams  of  processing  can,  in  principle,  be  quite  independent 
of  each  other. 

Since  the  invocation  of  P3QQ  is  dependent  on  the  evaluation  of 
information  conveyed  by  the  stimulus,  the  latency  of  P300  must  be  at  least 
as  long  as  the  duration  of  these  evaluative  processes.  The  overt  responses, 
on  the  other  hand,  may  well  be  released  “prematurely"  on  the  basis  of 
limited  information.  The  correlation  between  RT  and  the  latency  of  the  P300 
will  therefore  depend  on  the  degree  to  which  the  overt  responses  that  define 
the  RT  are  made  contingent  on  the  evaluation  of  the  stimulus.  The  more 
inclined  Is  the  subject  to  respond  prematurely,  the  poorer  the  correlation 
between  the  latency  of  the  P3Q0  and  the  RT. 

One  striking  aspect  of  the  data  acquired  by  Kutas  et  al .  (1977)  was 
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observed  in  the  trials  in  which  subjects  made  errors.  These  were  mostly 
trials  on  which  the  subject  responded  to  a  rare  event  as  if  it  were  a 
frequent  event.  That  is,  even  though  a  Male  name  appeared  on  the  screen, 
the  subject  pressed  the  button  associated  with  Female  names.  There  were  but 
a  few  such  trials  in  the  study  reported  by  Kutas  et  al .  (1977).  However,  in 
virtually  all  these  trials,  the  pattern  was  the  same  -  the  RTs  were 
(relatively)  short  and  the  P300  latency  was  (relatively)  long.  It  was  as  if 
on  these  trials  the  subjects  first  acted  and  than  thought!  A  partial  report 
on  these  data  can  be  seen  in  McCarthy  (1984).  In  10  of  the  11  subjects  the 
pattern  obtained  was  identical.  The  incorrect  responses  were  associated 
with  very  short  RTs  and  relatively  long  P300  latencies.  One  of  several 
possible  interpretations  of  these  data  is  that,  when  the  information 
processing  system  detects  an  error,  the  invocation  of  the  P300  is  delayed. 
According  to  this  suggestion,  the  delay  is  required  to  allow  further 
processing  of  the  trial's  data.  While  this  interpretation  is  consistent 
with  the  data,  it  is  not  the  only  possible  account  for  the  increased  latency 
of  the  P300  on  error  trials.  Several  other  possible  mechanisms  need  be 
considered.  Another  interpretation  of  the  data  is  based  on  the  fact  that  on 
all  the  error  trials  the  subject  responded  rather  fast  to  the  stimulus.  In 
other  words,  these  are  clearly  trials  on  which  a  variety  of  factors  are 
injected  into  the  stream  of  processing.  How  do  we  know  that  it  is  the 
recognition  of  the  error,  rather  than  the  fact  that  a  very  fast  response  was 
emitted  on  the  trial,  that  accounts  for  the  delay?  A  different,  but  related 
possibility  Is  that  it  is  not  that  P300  is  delayed  on  error  trials,  but 
rather  that  errors  may  be  more  likely  on  trials  on  which  P300  latency  is 
long.  Finally,  It  can  be  suggested  (see  McCarthy,  1984)  that  the  positive 
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peak  that  is  observed  on  the  error  trials  is  not  a  UP3QQ,“  at  least  not  that 
elicited  by  the  names,  but  is  rather  a  different  component,  perhaps  even  a 
different  "P300,"  which  may  be  elicited  by  the  recognition  of  the  error. 

The  last  interpretation  raises  the  issues  of  how  we  define  "P3Q0." 

There  is  no  doubt  that  one  of  the  major  difficulties  presented  by  ERP  data 
is  associated  with  the  definition  and  the  proper  Identification  of 
components  of  the  ERP.  As  we  noted  above,  if  our  strategy  depends  on  the 
measurement  of  the  timing  of  events  we  must  be  sure  that  the  timing  we 
measure  on  different  occasions  is  indeed  the  timing  of  the  same  event.  The 
approach  calls  for  considerable  care  in  the  definition  of  components  if  the 
features  of  the  waveform  we  measure  can  be  affected,  as  they  no  doubt  are. 
by  different  components  (Donchin  4  Heffley.  1978;  Fabiani,  Gratton.  Karis.  4 
Oonchin,  in  press).  Consider,  for  example,  each  of  the  positive  going  peaks 
observed  by  Kutas  et  al .  (1977).  Each  of  these  positive  peaks  has  been 
labeled  ,,P300‘'  even  though  the  peaks  differ  in  latency  by  as  much  as  100  ms. 
What  leads  us  to  believe  that  these  three  peaks  are  indeed  instances  of  a 
component  whose  latency  is  shifted  by  the  duration  of  the  processing 
preceding  its  invocation?  How  do  we  know  that  the  peaks  with  the  longer 
latencies  are  not  entirely  new  components  that  are  elicited  by  the 
presentation  of  a  word,  or  by  the  search  for  a  synonym?  The  issue  is 
generally  resolved  on  the  basis  of  the  similarity  of  waveshapes,  of  the 
scalp  distribution  of  the  potentials  and  of  the  manner  in  which  they  respond 
to  experimental  manipulations  (Donchin,  Ritter,  4  McCallum,  1978;  Sutton  4 
Ruchkin,  1984). 

To  resolve  some  of  the  doubts  that  remained  regarding  the  ERPs  elicited 
on  error  trials  we  replicated,  and  extended,  the  study  reported  by  Kutas  et 
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al .  (1977).  While  in  structure  the  study  reported  here  followed  closely  the 
"variable  name"  phase  of  the  Kutas  et  al .  (1977)  study,  we  expanded  the 
design  in  a  number  of  ways.  To  assure  that  the  number  of  error  trials  was 
sufficient  to  allow  the  needed  comparisons,  the  sessions  were  greatly 
extended  and  we  recorded  800  trials  in  each  of  the  experimental  conditions. 
As  before,  each  subject  performed  the  task  under  an  "accuracy"  and  under  a 
"speed"  regime.  Furthermore,  in  order  to  determine  the  extent  to  which  the 
observations  depend  on  the  Imbalance  between  the  probabilities  with  which 
the  two  categories  of  names  appear  in  the  series,  the  Speed  and  the  Accuracy 
conditions  were  run  twice.  In  one  case  the  Male  names  were  rare  { P= . 20 ) 
while  in  the  other  case  Male  and  Female  names  appeared  with  equal 
probability.  The  equal  probability  condition  (which  was  not  present  in  the 
Kutas  et  al .  study)  allowed  us  also  to  assess  the  parameters  (RT.  accuracy. 
P300  latency,  etc.)  of  fast  guesses,  where  there  was  no  particular  advantage 
for  either  response. 


Method 

Seven  right-handed  male  students  at  the  University  of  Illinois  were 
paid  $3.50  an  hour  for  their  participation  In  the  study.  The  subject  was 
positioned  In  front  of  a  PLATO  terminal  with  the  fingers  of  each  hand 
resting  on  a  zero  displacement  dynamometer.  Male  and  female  names  were 
presented  on  the  screen  for  200  msec  every  2000  msec,  and  the  subject  was 
required  to  squeeze  one  of  the  dynamometers  following  the  presentation  of  a 
name. 

Subjects  were  shown  the  names  in  blocks  of  100  trials.  Blocks  were 
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composed  of  either  80  female  and  20  male  names  or  50  of  each.  In  different 
trial  blocks,  the  subjects  were  Instructed  either  to  respond  as  quickly  as 
possible  ("speed  instructions"),  or  as  quickly  as  possible  but  without 
making  errors  ("accuracy  instructions") .  Eight-hundred  trials  were  run  for 
each  of  the  probability  x  instruction  conditions,  with  half  the  trials  run 
during  one  session  and  the  remaining  half  run  during  a  second  session. 

The  EEG  was  recorded  using  Burden  electrodes  at  Fz.  Cz.  Pz,  placed 
according  to  the  International  10-20  system  (Jasper,  1958)  and  referred  to 
linked  mastoids.  The  signals  were  amplified  and  filtered  on-line  (8  sec 
time  constant,  and  35  Hz  upper  half  amplitude  cutoff  point).  E0G  was 
recorded  for  purposes  of  subtracting  out  ocular  artifact  from  EEG.  The 
subtraction  of  the  ocular  artifact  was  accomplished  by  means  of  a  procedure 
described  by  Gratton,  Coles,  and  Oonchin  (1983).  The  EEG  and  E0G  signals 
were  digitized  at  100  Hz  for  a  period  of  1400  msec,  starting  100  msec  before 
each  stimulus  presentation. 

Average  ERP  waveforms  were  computed  for  each  instruction,  probability, 
stimulus  category,  subject  and  electrode.  P300  latency  and  amplitude  was 
assessed  on  each  single  trial  according  to  a  procedure  described  in  Gratton 
et  al .  (1985) . 


Results 

The  results  will  be  divided  into  several  sections.  First,  we  will 
present  data  supporting  the  claim  that  the  stimulus  probability  manipulation 
did  indeed  affect  the  subjects'  response  strategy.  Second,  we  will  describe 
the  relationships  between  P300  latency  and  RTs,  and  between  these  two 
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variables  and  response  accuracy.  These  data  replicate  the  results  obtained 
by  Kutas  et  al .  (1977).  Then,  we  will  describe  a  procedure  devised  to 
interpret  these  relationships.  Finally,  we  will  analyze  some  of  the 
consequences  of  the  processes  involved  in  P300  generation.  Note  that  the 
present  chapter  is  but  a  partial  report  of  the  study.  It  is  intended  to 
illustrate  the  chronometric  use  of  psychophysiol ogical  signals  rather  than 
to  serve  as  a  comprehensive  report  of  the  study.  Therefore,  we  shall 
ignore,  in  this  discussion,  the  data  obtained  In  the  accuracy  instruction 
condition.  Furthermore,  we  shall  ignore  many  of  the  detailed  analyses  of 
the  data  that  are  required  to  fully  support  our  interpretations.  For  a  full 
description  of  the  study  see  Gratton  et  al .  (1985).  Note  also  that  some  of 
the  analyses  were  based  on  five  subjects  only,  because  of  the  small  number 
of  error  trials  in  the  frequent  female  condition. 

Effects  of  Mani pul ations  on  Response  Strategy 

The  frequency  and  latency  of  correct  and  incorrect  overt  responses  for 
each  probability  x  stimulus  x  response  condition  are  shown  in  figure  1. 


Insert  Figure  1  About  Here 


The  subjects'  overt  response  was  affected  by  the  probability  of  the 
stimulus.  In  fact  the  response  to  the  rare  male  stimuli  was  less  accurate 
than  that  to  the  frequent  female  stimuli  (under  speed  Instructions),  as 
revealed  by  the  instruction  x  stimulus  x  response  Interaction,  F(l,4)=9.22, 
p< . 05 .  The  latency  of  the  correct  response  for  male  stimuli  was  slower  than 
that  for  female  stimuli,  while  the  latency  of  the  Incorrect  response  was 
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faster,  as  revealed  by  the  stimulus  x  response  Interaction,  F( 1 ,4)=11 .69, 
p<.05.  This  was  particularly  evident  for  the  20/80  condition,  F(l,4)*91.68, 
pc.OOl.  In  particular,  in  the  20/80  condition,  for  male  stimuli  (rare)  the 
incorrect  response  was  faster  than  the  correct  response  by  134  ms,  while  for 
female  stimuli  (frequent)  the  correct  response  was  50  ms  faster  than  the 
incorrect  response. 

These  findings  support  the  conclusion  that  the  subjects  indeed 
conformed  their  response  strategy  to  the  probability  manipulation.  In 
particular,  when  the  female  stimulus  was  presented  more  often  (the  20/80 
condition)  the  subjects  tended  to  execute  fast  female  responses  whatever 
stimulus  was  presented.  In  fact,  the  error  rate  for  male  stimuli  under 
these  conditions  is  64%.  while  the  error  rate  for  female  stimuli  is  only  5%. 

P3Q0 .  RT.  and  Accuracy 

The  grand  average  waveforms  at  Pz,  for  each  probability,  stimulus,  and 
response  condition  are  shown  in  Figure  2. 


Insert  Figure  2  About  Here 


Inspection  of  this  figure  reveals  several  interesting  points.  A  large 
positivity  is  the  most  dominant  feature  of  the  waveforms.  We  interpret  this 
positivity  as  P300.  The  latencies  of  the  P300  peaks  (shown  in  Figure  2)  for 
trials  vrfiere  a  correct  response  was  given  were  60  ms  shorter  than  for  trials 
where  an  incorrect  response  was  given,  F(l,4)*24.87,  p< . 0 1 .  The  category  of 
the  stimulus  did  not  affect  P300  latency,  F(l,4)*-0.66,  p>.05,  nor  did  the 
probability  manipulation,  F( 1 ,4) »1 . 31 ,  p>.05,  or  the  stimulus  x  probability 
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interaction,  F(1.4)*0.00,  p< .05 .  Thus,  the  time  the  subject  takes  to  emit  a 
P3QQ  does  not  depend  on  whether  the  stimulus  is  male  or  female,  or.  in  fact, 
on  whether  the  probability  of  the  stimulus  is  manipulated.  These  results 
contrast  with  those  obtained  for  the  RT.  They  indicate  that  the  timing  of 
those  processes  on  which  the  emission  of  the  P300  depends  is  not  affected  by 
the  variations  in  the  criteria  for  overt  response  emission,  which  were 
introduced  by  the  experimental  manipulations.  However,  the  amplitude  of  the 
P300  (see  Figure  2)  was  affected  by  some  of  these  variables.  In  particular, 
the  male  stimuli,  when  rare,  elicited  a  larger  P30Q  than  the  female  stimuli 
(in  this  case,  frequent).  This  produced  a  significant  stimulus  x  probability 
interaction,  F(1.4)=  15.34,  p< .05 .  On  the  other  hand.  P300  for  incorrect 
responses  was  only  slightly  (and  not  significantly)  larger  than  for  correct 
responses,  F(1.4)=2.99.  p> .05 . 

Summarizing  these  findings,  we  note  that  we  have  replicated  and 
extended  the  Kutas  et  al .  (1977)  study.  Error  trials  are  generally 
associated  with  faster  RTs.  but  later  P3Q0s  than  correct  trials.  The 
stimulus  probability  affects  RT  but  not  P300  latency. 

Error  Recognition 

The  results  presented  above  are  consistent  with  the  interpretation  that 
P300  latency  is  influenced  by  variables  affecting  the  time  required  to 
evaluate  the  stimulus,  but  relatively  independent  of  the  variations  of  the 
response  criteria  adopted  by  the  subject.  However,  the  observation  that  the 
overt  response  is  fast  and  P300  late  on  error  trials  require  some 
explanation.  In  fact,  two  explanations  are  possible.  First,  it  may  be 
that,  in  analyzing  the  error  trials,  we  select  those  trials  in  which  fast 
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guessing  and/or  delay  in  the  evaluation  of  the  stimulus  occur.  Thus,  errors 
may  occur  because  stimulus  evaluation  (i.e.,  P300)  is  late  in  comparison 
with  response  activation  processes.  Second,  it  may  be  that  processing  of 
the  error  may  delay  P300.  In  this  case,  the  latency  P300  would  not  only 
reflect  the  processing  of  the  external  stimulus  (male  or  female  name),  but 
also  the  processing  of  the  internal  events  leading  to  the  response. 

To  choose  between  these  two  Interpretations,  we  focussed  our  attention 
on  a  condition  in  which  errors  are  particularly  frequent  (the  “speed" 
condition).  Our  procedure  was  based  on  an  analysis  of  the  speed/accuracy 
functions  for  this  condition.  In  addition,  we  were  interested  in 
distinguishing  among  trials  with  different  P300  latency.  Speed/accuracy 
functions  for  different  P3Q0  latency  bins  are  presented  in  Figure  3.  for 
20/80  and  50/50  conditions  separately. 


Insert  Figure  3  About  Here 


For  the  20/80  conditions,  the  functions  were  computed  separately  for  male 
and  female  names.  For  the  50/50  condition,  data  from  the  two  name 
categories  were  pooled  together,  since  they  had  the  same  probability.  These 
functions  reveal  several  interesting  points.  First,  the  longer  the  RT  the 
higher  the  accuracy.  Second,  accuracy  is  higher  for  trials  on  which  P300 
latency  Is  relatively  short.  These  findings  suggest  that  accuracy  is 
largely  dependent  on  the  relative  timing  of  P300  latency  and  RT. 

We  also  note  the  very  low  accuracy  when  the  rare  male  names  are 
presented.  This  Is  especially  true  when  RTs  are  fast  and  P30Q  latency  Is 
long.  In  this  case,  we  might  speculate  that  the  subject's  basic  strategy  is 
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to  emit  the  "female"  response  to  the  “male"  stimulus.  In  fact,  virtually  no 
errors  can  be  observed  in  response  to  frequent  female  names.  It  may  be 
possible  to  suggest  that  this  pattern  of  results  is  due  to  the  fact  that 
errors  occur  when  stimulus  evaluation  time  is  for  some  reason  slow  so  that 
the  male  stimulus  is  not  processed  fast  enough  to  inhibit  the  female 
response.  If  this  explanation  is  valid  then  the  delay  in  P300  on  error 
trials  is  not  due  to  the  processing  of  the  consequences  of  the  error. 

However,  a  third  observation  is  not  compatible  with  the  interpretation 
that  P300  is  solely  dependent  on  the  time  required  to  decide  whether  the 
stimulus  was  male  or  female.  In  fact,  if  P300  is  sensitive  only  to  the 
stimulus  categorization  process,  and  a  delay  in  P30Q  indicates  only  a  delay 
in  this  categorization  process,  then  responses  given  before  this  process  is 
sufficiently  established  should  have  a  chance  level  of  being  correct.  In 
the  50/50  condition  this  chance  level  is  .50.  Thus,  this  interpretation 
should  predict  that,  in  the  speed  50/50  condition,  the  error  rate  would 
never  exceed  .5  even  in  cases  of  long  P300  latency  and  short  RTs.  Actually, 
figure  3  reveals  that,  in  this  condition,  the  accuracy  for  trials  with  fast 
RTs  and  slow  P300  latency  is  lower  than  the  chance  level.  This  indicates 
that,  by  looking  at  trials  with  fast  RTs  and  long  P300  latency,  we  are 
"selecting"  error  trials.  We  interpret  this  finding  as  demonstrating  that 
the  association  between  incorrect  response  and  long  P300  latency  is  not  due 
solely  to  the  fact  that  errors  occur  because  of  a  delay  in  stimulus 
evaluation.  We  must  also  propose  that  the  processing  of  the  incorrect 
response  causes  a  delay  in  P30O. 
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The  P3QQ  and  Future  Action 

We  have  demonstrated  that  P300  is  delayed  on  incorrect  trials.  This 
delay  indicates  that,  before  emitting  the  P300,  the  subject  must  have  not 
only  categorized  the  name,  but  also  compared  the  stimulus  category  with  the 
current  response.  Presumably,  the  delay  in  P300  reflects  some  process  that 
occurs  when  the  system  processes  the  conmission  of  an  error  (a  recognition 
that  need  not  reach  the  subject's  awareness).  Given  the  relationship 
between  P300  and  schema  updating  (Donchin,  1981;  Karis,  Fabiani,  and 
Donchin.  1984),  we  hypothesized  that  in  the  present  experiment  this  process 
was  related  to  adjustments  in  the  subject's  strategy  subsequent  to  the 
recognition  of  an  error.  If  this  hypothesis  is  correct  then  the 
characteristics  of  the  P30Q  elicited  on  the  error  trials  should  predict 
variations  in  the  response  criteria  in  the  following  trials. 

To  test  this  hypothesis  we  used  the  following  procedure.  First,  we 
identified  the  male  trials  in  which  an  incorrect  response  was  given.  Then, 
we  sorted  these  trials  on  the  basis  of  the  response  made  on  the  following 
trial  on  which  a  male  name  was  presented.  The  rationale  was  the  following. 
We  assumed  that  when  an  incorrect  response  to  a  male  trial  was  given,  the 
subject  was  biased  to  emit  the  “female"  response  regardless  of  the  stimulus. 
If  the  subject  responded  again  incorrectly  to  the  following  male  trial ,  then 
we  assumed  that  the  subject's  bias  remained  the  same.  On  the  other  hand,  if 
the  subject  responded  correctly  to  the  following  male  trial,  then  we  assumed 
that  the  subject  had  revised  his  strategy.  We  labelled  the  latter 
sequences,  "switch"  sequences,  and  the  former,  “no-switch“  sequences.  Note 
that  we  assumed  that  the  switch  In  response  bias  occurred  as  a  consequence 
of  the  recognition  of  an  error  after  the  first  trial  of  a  sequence.  In 
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particular,  we  predicted  that  the  P300  to  the  first  stimulus  of  the  sequence 
(incorrect  male  name)  was  larger  for  "switch"  than  “no-switch"  sequences. 

For  both  the  20/80  and  50/50  conditions  P3QQ  elicited  on  the  first  trial  of 
a  “switch"  sequence  was  larger  than  P300  elicited  on  the  first  trial  of  a 
"no-switch"  sequence.  F(l,5)=6.66,  p< .05 .  and  F(1.5)*14.25,  p<.05, 
respectively.  To  test  further  the  hypothesis  that  the  switch  in  response 
bias  does  indeed  occur  inmediately  following  the  recognition  of  an  incorrect 
response  to  a  male  trial,  we  examined  the  female  trials  which  intervened 
between  the  first  and  last  male  trials  of  the  sequence  described  above.  The 
prediction  was  that  the  response  to  the  intervening  female  trial  should  be 
slower  for  "switch"  than  “no-switch“  sequences.  The  RT  for  these  trials  are 
shown  in  Figure  4.  as  a  function  of  sequence  (switch  vs  no-switch),  lag  from 
the  first  trial  of  the  sequence,  and  condition  (20/80  vs  50/50). 


Insert  Figure  4  About  Here 


Accuracy  is  higher  and  RTs  are  slower  for  female  trials  in  a  "switch" 
sequence  than  in  a  "no-switch"  sequence.  F(l. 61*13.36.  p< .05 .  F(1.5)=7.50, 
p<.05,  respectively.  This  indicates  that  the  subjects  did  indeed  modify 
their  response  strategy  at  the  beginning  of  a  "switch"  sequence. 

Oi scussion 

The  data  presented  above  indicate  that  when  a  subject  chooses  the  wrong 
alternative  in  a  two  choice  discrimination  task,  and  that  error  is  more  than 
likely  due  to  a  bias  to  respond  to  the  "wrong"  stimulus,  this  recognition 
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tends  to  introduce  a  delay  of  about  100  ms  In  the  invocation  of  the  P3Q0  by 
that  same  stimulus.  This  delay  in  P300  by  the  occurence  of  an  error  appears 
to  be  related  to  an  evaluation  by  the  system  of  the  context  within  which  it 
operates.  The  data  appear  consistent  with  the  suggestion  that  the  magnitude 
of  the  P300  can  serve  as  a  measure  of  the  degree  of  revision  in  the  system's 
biases.  This  assertion  is  inferred  from  the  fact  that  the  larger  the  P300 
elicited  following  the  error  the  less  likely  the  error  on  the  next 
error-prone  trial.  Moreover,  the  larger  the  P3QQ  that  is  elicited  on  error 
trials  the  slower  will  be  the  subject  to  respond  on  the  immediately 
succeeding  trial.  A  shift  in  response  bias,  or  a  tendency  to  place  the 
response  under  controlled,  rather  than  automatic,  processing  mode  are 
plausible  interpretations  of  these  data. 

These  results,  and  the  interpretation  proposed  above,  indicate  how  the 
P300  and  the  study  of  its  latency  and  amplitude  can  reveal  aspects  of  the 
manner  in  which  the  human  information  processing  system  deals  with  error 
trials.  These  aspects  are  opaque  to  the  more  traditional  tools  largely 
because  they  permit  a  view  of  information  processing  activities  that  do  not 
have  an  overt  mani festation  in  performance  on  the  trial  in  which  they  are 
elicited.  The  view  that  emerges  is  one  in  which  at  least  two  information 
processing  streams  proceed  in  parallel.  Both  depend  for  their  initiation  on 
the  initial  detection  and  encoding  of  the  stimulus.  However,  the  processing 
that  leads  to  the  overt  response  may  be  completed,- yielding  the  actual 
response.  Independently  of  the  evaluative  processes  whose  role  is  the 
maintenance  of  the  operating  environment. 

The  metaphor  that  captures  our  Intent  is  that  of  an  organization  whose 
operating  and  adnlni strati ve  arms  operate  In  a  highly  interactive,  but 
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nevertheless  Independent  manner.  Actions  by  the  organization's  staff  are 
taken  in  the  light  of  the  1 ~ cal  interpretation  of  ongoing  events  and  under 
the  constraints  established  by  the  administration's  policy.  Each  event 
outside  the  organization,  and  each  action  by  the  organization,  trigger  in 
the  administration  an  evaluative  process  that  may  long  outlast  the  staff's 
actions  as  the  administration  must  optimize  its  operating  policies  given  the 
consequences  of  its  own  actions  and  the  events  in  its  surround.  The  time 
course  of  the  administrative  processing  may  be  quite  independent  of  the  time 
course  of  the  processing  required  by  the  operating  staff  before  it  takes 
action.  Indeed,  if  staff  action  was  patently  erroneous  the  admini stration 
may  require  additional  time  before  it  closed  the  book  on  the  action,  files 
the  reports  and  makes  the  necessary  adjustments  in  policy. 

The  P300  component  can.  we  believe,  be  viewed  as  a  manifestation  of 
''admini  strati  ve“  rather  than  "operational1'  information  processing.  Oonchin 
et  al .  (1978)  labelled  these  classes  of  information  processing  "strategic" 
and  "tactical"  respectively.  Evidence  is  accumulating  that  the  process 
manifested  by  the  P3Q0  is  "future  oriented"  (see,  for  example.  Donchin. 

1981;  Fabiani.  Karis,  Coles,  S  Oonchin,  1983;  Karis  et  al.,  1984;  Klein. 
Coles,  S  Oonchin,  1984).  The  data  we  reviewed  in  this  chapter  are 
consistent  with  this  view.  It  seems  clear  that  the  magnitude  of  the  P30Q 
elicited  on  an  error  trial  Is  related  to  the  performance  of  the  subject  on  a 
subsequent  trial.  Such  an  effect  implies,  almost  by  definition,  that  the 
process  manifested  by  the  P300  has  consequences  for  future  performance.  It 
Is,  of  course,  possible  that  the  relationship  we  observed  is  fortuitous  and 
both  P300  amplitude  on  trial  M,  and  the  subject's  performance  on  trial  N+l, 
are  correlated  with  yet  a  third  factor  accounting  for  both  variations.  To 
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address  this  issue  we  must  continue  seeking  the  elucidation  of  the 
functional  significance  of  the  P300. 

In  many  ways  the  study  described  in  this  report  is  a  direct  descendant 
of  the  work  that  Lindsley  and  his  colleagues  undertook  as  the 
electroencephalographic  techniques  made  their  ways  from  Europe  to  the  United 
States  in  the  1930‘s.  The  EEG  has.  of  course,  become  a  standard  clinical 
tool  and  much  of  the  research  utilizing  the  EEG  is  clinical  in  nature  and  in 
orientation.  There  is,  however,  a  flourishing  research  enterprise  in  which 
the  EEG.  and  the  ERPs  embedded  in  it.  are  used  as  tools  in  the  study  of 
cognitive  function.  Lindsley1 s  work,  spanning  more  than  half  a  century  was, 
and  continues  to  be,  an  outstanding  illustration  of  the  way  a  scientist 
bringing  the  skills  and  sensibilities  of  a  psychologist  can  turn  the  record 
of  a  bodily  function  so  it  provides  a  window  on  the  mind.  A  key  element  in 
this  enterprise  is  the  chronometric  approach  that  hasbeen  so  important  in 
Lindsley' s  research  program. 
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Figure  Legends 

Figure  1.  Reaction  time  and  P300  latency  for  each  probability  (20/80 
and  50/50),  stimulus  (male  and  female)  and  response  (correct  -  C  -  and 
incorrect  -  I)  condition.  The  frequency  of  correct  and  incorrect  responses 
for  each  condition  is  also  indicated. 

Figure  2.  Grand  average  waveforms  at  Pz  for  each  probability,  stimulus 
and  response  condition.  The  solid  lines  refer  to  the  grand  average 
waveforms  for  the  correct  responses,  and  the  dashed  lines  to  the  waveforms 
for  the  incorrect  responses.  The  average  reaction  times  for  correct  (solid) 
and  incorrect  (dashed)  responses  are  indicated  by  vertical  lines. 

Figure  3.  Speed-accuracy  functions  for  rare  male  stimuli,  frequent 
female  stimuli  and  all  stimuli  in  the  50/50  condition.  Separate 
speed-accuracy  functions  were  computed  for  each  of  three  P30Q  latency  bins 
(600  to  699  ms,  700  to  799  ms,  and  800  to  899  ms). 

Figure  4.  Reaction  time  and  error  rate  for  female  trials  in  SWITCH  and 
NO  SWITCH  sequences,  as  a  function  of  the  lag  from  the  incorrect  male  trial 
initiating  the  sequences,  for  the  each  probability  condition. 
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Word  Lists 

(To  Remember  or  Jud^ei 

Priming 
und  N400 
Paradigms 

Signal 

Detection 

Sequence  generating  rule 

•\  random  or  constrained 
B  probability 

random 
rare  frequent 

randtim 

2  rare. 

random 

fares  in  some 

constrained 

random 

random 

Trials 

2  frequent 

experiments 

A  number  of  stimuli  and 
temporal  relationships 

1  number  oi  Mimuli 
per  trial 

2  among  trials  ( see ) 

B  stimuli 

1 

1-3 

1 

<  1 

1 

7 

2  or  more 

1  to  >  5 

2  or  more 

3  to  >  10 

i 

2-3 

I  modality 

-  ph)  steal  properties 

aud  v!  vis 

auditory 

visual 

visual 

auditory 

low 

intensity; 

detection 

dil'IicuU 

easily  disc. 

*  complexity,  structure. 
auJ  content 

Mapping  rule 

usually  simple 
Iwide  variety  ol 
stimuli  j 

simple  Uonesl 

moderately  complex 
(words! 

relationships 
present  among 
stimuli  (primarily 
words) 

simple 

(tones) 

moderately 
complex 
( words) 

A  memory  load 

B  nature  ol  processing  and 
transiormahon  required 

C  complexity  ot  response 

selection 
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usually  simple 

choice  RT  or  go/ 
nogo 

tow. 

selective  attention 

go'nogo 

usually  low 
complex 

choice  RT  or  nothing 

low 

priming  in 
semantic  memory 

choice  RT 

low 

simple 

choice  RT 

high 
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search  & 
comparison 
choice  RT 

Response  execution 

A  covert  or  overt 

B  discrete  or  continuous 
t  ballistic,  or  involving 

teed  hack 

either 

Usually  coven 

either 

overt 

overt 

overt 

ballistic 

no  response 

ballistic 

ballistic 

ballistic 

ballistic 

Table  2.  Experimental  Variables  that  Affect  P300 


Diluent  urn*  oj  the  Taxonomy 

Sequence  generating  rule 

A  random  or  constrained 
B  probability 

Trials 

A  number  of  stimuli  and  temporal 
relationships 

1  number  of  stimuli  per  trial 

2  among  tnals 

B  stimuli 

1  modality 

2  physical  properties 


H$UO  Amplitude 


constrained  smaller 
5-15  pv  effect  (rare  larger) 


probability  etlect  disappears  at 
long  IS  Is  { >  3  sec) 

visual  larger  (0-5  p.v) 


PJ(X)  Latency 


50-80  ms  difference  (rare 
longer) 


visual  longer  (100  ms)* 

50-150  ms  longer  with 
degraded  stimuli  or  in  noise 
conditions 


pjOO  Scalp  Distribution 


7 

no  difference 


Cz  is  more  negative  when  there 
is  a  warning  (and  thus  a  CNV) 


Cz  usually  more  positive  for 
visual? 

intensity  may  have  an  effect;  Cz 
may  be  =  or  >  Pz  at  low 
levels 


3  complexity,  structure  and  smaller  tor  complex,  but  longer  tor  more  complex  stimuli 

content  confounded  with  latency  (may  exceed  100  ms) 

3  Mapping  rule 

A  memory  load  requirements 
B  nature  of  processing  and 
transformation  required 


C  complexity  of  response  selection 

4  Response  execution 

A  covert  or  overt 
B  discrete  or  continuous 

C  ballistic,  or  involving  feedback 

5  Consequences  0-5  p.v  increase  with  bonus  tor  —  — 

correct  gucss/prediction 

h  Subject  characteristics  decreases  with  age  trom  Increases  with  age  from  more  frontal  with  age?  smaller 

maturity  maturity  (1-2  ms/year)  differences  among  Fz,  Cz,  Pz? 

*ln  i iii im  audit's  visual  stimuli  arc  more  complex  than  auditory  stimuli,  and  there  is  a  strung  elleci  of  complexity  on  P300  latency  (see  2  B  3) 


selective  attention  required;  memory  search  (slope  0-25  ms)  may  be  changes  due  to  feedback 

positive  (old)  items  5  f±v  or  depending  on  their  evaluation; 

more  larger  in  recognition  or  dual  tasks:  Pz  decreases  relative 

Sternberg  paradigms;  reduction  to  Cz 

(5-10  p.v)  with  increase  in 

workload  (in  dual  tasks);  0-5 

p.v  memory  effect 

—  —  increase  m  Cz  on  nogo  tnais? 
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used  (e  g  .  male  versus  female  names,  or  animals  versus  nonanimals).  To  give  an  exam-  tone  or  visual  pattern  once  every  second,  and  instructed  to  count  how  often  the  tone  or 

pie,  a  simple  classic  auditory  oddball  paradigm  might  involve  presenting  l(XK)  and  1 5(X)  pattern  fails  to  appear  Since  there  is  no  actual  physical  stimulus  present.  I.RPs  elicited  by 

Hz  tones  randomly,  wilh  the  I5(X)  Hz  lone  occurring  20%  of  ihe  time.  The  subject  would  the  counted  omission  can  contain  only  endogenous  components  The  results  ol  these  stud 

he  instructed  to  count  the  high  ( 15(H)  Hz)  tones  and  give  a  running  total  al  Ihe  end  of  a  jcs  were  important  in  the  early  arguments  that  IM(X)  was  indeed  an  endogenous  cornpo 

block  of  trials  (usually  IIX)  or  morel  nenl  Comparisons  between  missing  stimulus  potentials  (cimmv/  potentials)  and  stimulus 


H  H  ■=  1  5  £  si  £  J 

^  C  7 

—  —  — *  £  —  E 

4J  CXX.y;  2y“X-J 

=>  ■*  °-  -j  *-  T3  y  “*“  . 


£  E  -2  5-  £  u  s- 

c  -*  >L  -3  ~  c  ^ 

?  J  —  -5  -*  £  . 

1  »  x  -d  ^  |  = 

=  i  *  *  “  ^  5 

e  5  ;  -a  .!  e-  s 

b  “3  =  ^  £  c  ~ 


*  '  L 
C  c  3 


~  o  y 

-X  -O  =3 

-  w  "2 


1  5  £ 

n  C  _ 


^  E3 

£  £  JE 
..  ■£  to 


;c  = 

i1  04  x 

-  -  * 

u  ’“* 

SIC 

a.  v 

^  i  ■o 


oi  'j5  C  .  •*» 
y  c  3  Jr:  s 


y?  0C  — 

!1  ^  ^ 

“  1  = 

:|f! 

I  .E  _c 

•  75  > 

.  'J  0. 

I  .-a 

"-  -3 

O'  c  c 

—  C  03 


^  “  3 

i/i  "3  X) 


^  y  y. 

.'£  e> 

U  *  ^ 

oj  -3  r, 


■—  -J=  Q- 

C  y  o 
>.  ^  -c 


§  5!> 

•  5  5  c  wj  £ 


*  o  -5 

i;  *  i 


£  £  £  £ 


■  “  E  = 

r  t  i 


r 3  ^  O0 
w  c 

I  Si 

^  3 

I 

a.  a.  I 


r-  c  £ 

—  o  3 
Cl  ?■  r 

p  Ci 

7  0.= 
£ 

2  c  >^ 

*5  *  *5 
_>>  ^  •- 
§?  i  i 


y  o  y 
o  5 


2  .2  = 
3  — 


1 .2  £  a  -2  £ 


r3  *-»  £  ~  -c 

M  Ml 

ei  3  ^  ?  •-■ 

/  O  ‘J  C 

i!  £  =  g.£ 

=  *  f  s  “ 

—  f-  ?3  C  5 


5  «,  r 

^  c:  3 

=6  =  5 

2  3.  -R 


— •  3  *E  a. 

?3  3-  2 

C  OJ  —  T3 

r  ^  -  £ 

“  i  |  £ 

s  - 


c  w  ~  _ 

=  —  ■_  n 

u  £  Z  ~ 

.'  S  ;  J 


3  -C  £ 
c  -  ■£ 


Q_  St  3 

!*  15 

.  H  !  1 

I  s?  I  f 


E  a  y  ■ 

-  o  c 
2  £  £.- 


1-5  I  I 


I  I  “  J 

3  ■->  —  d 
£  j?  £  7 
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Abstract 

The  present  study  examined  mechanisms  of  response  activation  in  a 
choice  reaction  time  task.  Six  subjects  were  instructed  to  respond 
differentially  to  the  central  letter  of  one  of  four  arrays,  HHHHH,  SSHSS, 
SSSSS,  and  HHSHH,  which  were  presented  in  a  random  sequence.  Instructions 
emphasized  speed.  Measures  of  the  Event-Related  brain  Potential  (ERP),  and 
of  electromyographic  activity  (EMG)  were  obtained  on  each  trial. 

Conditional  accuracy  functions  (i.e.,  accuracy  conditional  on  latency) 
were  character ized  by  three  phases.  For  very  fast  responses,  accuracy  was 
at  chance  level  for  all  arrays,  suggesting  that  subjects  were  guessing.  For 
intermediate  latency  responses,  accuracy  was  above  chance  if  the  noise  was 
compatible  with  target,  and  below  chance  if  it  was  incompatible.  This 
pattern  of  data  suggests  that  these  responses  were  based  on  partial  stimulus 
analysis.  For  slow  responses,  accuracy  was  above  chance  for  all  arrays, 
suggesting  that  these  responses  were  based  on  complete  stimulus  analysis. 

Motor-related  brain  potentials  provided  further  information  about 
mechanisms  of  response  activation.  The  occurrence  and  the  accuracy  of  fast 
responses  could  be  predicted  by  examining  motor  potentials  preceding  the 
presentation  of  the  array.  Furthermore,  measures  of  the  motor  potentials  in 
the  period  following  the  presfintation  of  the  array  suggested  that  both 
responses  were  energized  by  array  presentation,  that  partial  analysis  of 
stimulus  information  could  activate  responses, 'and  that  the  level  of 
response  activation  at  the  time  of  the  triggering  of  the  EMG  response  was 
constant  for  trials  with  different  response  latencies.  The  data  are 
discussed  within  the  framework  of  a  response  channel  conception. 
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The  present  paper  reports  results  of  an  investigation  of  the  mechanisms 
that  lead  to  the  activation  of  responses  in  a  choice  reaction  time  (RT) 
paradigm.  It  is  usual  to  define  "responses'1  as  the  mechanical  events  (e.g., 
switch  closures)  that  are  recorded  by  the  experimenter.  However,  these 
events  are,  in  fact,  the  consequence  of  a  complex  chain  of  causally  related 
phenomena  that  can  be  traced  back  through  the  spinal  cord  and  the  motor 
areas  to  various  pre-setting  mechanisms.  While  closing  the  switch  is  the 
goal  imposed  on  the  subject  by  the  experimenter,  the  real  task  of  the 
subject  is  to  accomplish  the  movement  that  will  result  in  the  switch 
closure.  Thus,  to  understand  how  the  switch  comes  to  be  closed,  it  is 
arbitrary  to  choose  the  point  of  switch  closure  as  the  level  at  which  the 
analysis  should  be  conducted.  Indeed,  as  we  hope  to  illustrate,  we  are  led 
to  a  much  richer  understanding  of  the  processes  involved  in  a  choice  RT  task 
when  we  broaden  our  research  focus  to  include  different  levels  of  the 
response  process  rather  than  restricting  ourselves  to  the  switch  closure. 

Support  for  this  approach  can  be  found  in  studies  of  response 
preparation  and  response  competition,  which  have  both  been  interpreted  in 
terms  of  "subthreshold"  response  activation,  that  is,  activation  of  the 
response  system  which  is  not  sufficient  to  trigger  an  overt  response.  Thus, 
response  preparation  can  be  conceptualized  as  an  energizing  phenomenon 
(Posner,  1978;  Requin,  1985),  by  which  response  structures  are  activated  at 
a  subthreshold  level.  Indeed,  when  Requin  (198b)  recorded  from  the  motor 
cortex  of  monkeys,  he  observed  an  increase  in  the  firing  rate  of  neurons 
during  the  foreperiod  of  RT  tasks.  Some  of  these  neurons  showed  a  further 
increase  in  firing  rate  during  the  execution  of  the  overt  response. 

Response  competition,  on  the  other  hand,  can  be  viewed  as  the  reciprocal 
inhibition  of  competing  response  structures  (see  Eriksen  &  Schultz,  1979; 
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Sherrington,  1906),  in  which  the  suothreshold  activation  of  one  response 
leads  to  a  delay  in  the  execution  of  another.  This  interpretation  of 
response  competition  has  been  supported  by  the  results  of  studies  in  which 
electromyographic  (EMG)  responses  were  recorded  in  association  with  overt 
responses  (see  Coles,  Gratton,  Bashore,  Eriksen,  &  Donchin,  1985;  Eriksen, 
Coles,  Morris,  &  O'Hara,  1985). 

The  concept  of  response  channels 

The  idea  that  there  can  be  various  levels  of  response-related  activity 
in  the  nervous  system  can  be  conceptualized  in  terms  of  the  notion  of 
"response  channel."  We  use  this  term  as  a  heuristic  device  to  refer  to 
that  complex  of  structures  whose  activities  are  more  or  less  directly 
related  to  the  mechanical  event  which  is  defined  as  the  overt  response.  A 
similar  conception  can  be  found  in  Gaillard  (1978)  and  Naatanen  and  Merisalo 
(1977). 

We  consider  the  response  channels  to  be  distinct  from  the  stimulus 
evaluation  system,  whose  function  is  to  analyze  stimulus  information, 
resulting  in  activation  of  the  appropriate  response  channel.  However,  the 
level  of  activity  of  the  response  channel  may  be  affected  by  other  factors 
such  as  response  priming  or  bias  and  response  competition. 

Within  this  framework,  different  degrees  of  involvement  of  response 
structures  are  viewed  as  different  degrees  of  response  channel  activation. 
Thus,  response  preparation  can  be  represented  as  an  increase  in  the  level  of 
activation  of  a  response  channel,  while  response  competition  is  a  decrease 
in  the  level  of  activation  in  one  response  channel  as  a  result  of  the 
increase  in  the  level  of  activation  of  another  channel.  The  involvement  of 
particular  response  structures  can  be  represented  by  different  thresholds  of 
response  channel  activation.  Thus,  there  is  a  threshold  for  the  initiation 
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of  muscle  activity,  as  revealed  in  the  EMG ,  and  a  higher  threshold  for  the 
initiation  of  an  overt  movement  (see  Coles  et  al.,  1985). 

Measuring  the  Activation  of  Response  Channels 

Information  about  the  level  of  activation  of  the  response  channels  can 
be  gained  by  studying  the  activity  of  neural  and  muscular  structures 
involved  in  the  response,  as  well  as  by  examining  the  overt  response.  In 
our  previous  work  (Coles  et  al..  1985;  Eriksen  et  al . ,  1985),  we  recorded 
both  the  EMG  activity  and  the  overt  response  (a  squeeze  of  a  dynamometer)  to 
determine  the  time  at  which  the  level  of  response  channel  activation  reached 
two  particular  thresholds  (that  for  the  EMG  response  and  that  for  the 
squeeze  response).  While  this  procedure  allowed  us  to  identify  two  points 

in  the  response  channel  activation  function,  it  did  not  provide  a 

* 

continuous,  analog  description  of  the  function. 

In  the  present  study,  we  used  a  component  of  the  event-related  brain 
potential  (ERP),  the  readiness  potential,  as  a  measure  of  response  channel 
activation.  This  component  was  first  described  by  Kornhuber  and  Deecke 
(1965)  in  a  study  of  self-paced  movements.  These  authors  recorded 
electrical  activity  from  scalp  electrodes  and  observed  a  negative,  ramp-like 
potential,  that  began  some  800  ms  prior  to  the  execution  of  a  movement  and 
peaked  just  after  the  onset  of  the  movement.  Later  research  has  shown  that 
this  potential  is  largest  at  scalp  electrodes  placed  over  the  hemisphere 
contralateral  to  the  hand  responsible  for  the  movement  (e.g.,  Kutas  & 
Donchin,  1974,  1977,  1980;  Vaughan,  Costa,  &  Ritter,  1972).  Furthermore, 
several  investigators  (e.g.,  Kutas  &  Donchin,  1980;  Rohrbaugh,  Syndulko,  & 
Lindsley,  1976)  have  described  a  similar  lateralized  potential  that  develops 
during  the  foreperiod  of  simple  warned  RT  tasks.  The  peak  of  this 
lateralized  potential,  which  occurs  following  the  imperative  stimulus. 
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appears  to  be  time-locked  to  the  overt  response.  A  similar  lateral ized 
potential  develops  after  the  imperative  stimulus  in  choice  RT  tasks,  when 
response  choice  depends  on  information  provided  by  the  imperative  stimulus, 
but  before  the  imperative  stimulus  when  the  choice  depends  on  information 
provided  by  the  warning  stimulus  (Kutas  and  Oonchin,  1980).  Thus,  the 
potential  appears  to  become  lateralized  when  a  choice  is  made  about  the 
responding  hand. 

These  observations  led  Kutas  and  Donchin  to  argue  that  the 
lateral ization  of  the  readiness  potential  can  be  viewed  as  an  index  of 
specific  motor  preparation. 1  Subsequent  reports  (e.g.,  Okada,  Williamson,  & 
Kaufman,  1982),  that  have  used  neuromagnet ic  techniques  to  relate  this 
potential  to  motor  cortex  activity,  also  lend  credibility  to  the  suggestion 
that  the  readiness  potential  is  a  good  candidate  for  a  psychophysiological 
measure  of  response  channel  activation.  Therefore,  we  propose  to  use  the 
lateralization  of  the  readiness  potential  as  a  measure  of  the  relative 
activation,  or  priming,  of  response  channels  prior  to  stimulus  presentation. 
Further  variations  in  the  lateralization  of  the  readiness  potential 
occurring  af ter  the  presentation  of  the  stimulus  should  reflect  the  relative 
activation  of  the  response  channels  as  a  consequence  of  the  processes  of 
stimulus  evaluation. 

Pre-stimulus  activation 

Pre-stimulus  activation  refers  to  activation  of  a  response  channel 
prior  to  the  presentation  of  the  imperative  stimulus.  When  pre-stimulus 
activation  occurs,  the  corresponding  response  should  be  facilitated,  because 
the  activation  level  at  the  time  of  stimulus  presentation  is  closer  to  the 
overt  response  threshold.  In  extreme  cases  of  pre-stimulus  activation, 
small  increases  in  activation  that  may  occur  as  a  consequence  of  mere 
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stimulus  presentation  are  sufficient  to  raise  the  activation  level  above  the 
threshold  and  trigger  the  response.  In  this  way,  the  probability  of 
emitting  a  response  in  the  absence  of  stimulus  information  will  be  enhanced. 
Responses  of  this  type  are  commonly  labelled  “fast  guesses."  According  to 
this  view,  then,  fast  guesses  occur  when  there  is  an  increased  level  of 
response  channel  activation  prior  to  the  imperative  stimulus.  Such  a 
hypothesis  is  compatible  with  explanations  of  the  effects  of  instructional 
sets,  priming  or  warning  stimuli,  and  manipulations  of  response  probability, 
on  the  accuracy  and  latency  of  responses  (see,  for  instance,  Posner,  1978). 
More  generally,  this  hypothesis,  the  variable  baseline  hypothesis, 
interprets  variability  in-response  bias  in  terms  of  variability  in  the 
baseline  level  of  response  channel  activation. 

This  hypothesis  differs  from  that  proposed  by  Grice,  Nullmeyer,  and 
Spiker  (1982)  who  argued  that  fast  guesses  are  a  consequence  of  variations 
in  the  setting  of  the  response  criterion.  According  to  this  "variable 
criterion  hypothesis,"  fast  guesses  occur  when  the  response  criterion  is  set 
to  such  a  low  level  that  small  fluctuations  in  the  activation  of  the 
response  channel  (occurring  after  the  stimulus)  are  sufficient  to  cross  the 
criterion,  or  threshold,  and  trigger  the  response. 

We  propose  that  measures  of  the  lateral ized  readiness  potential  can  be 
jjsed  to  evaluate  these  two  hypotheses.  In  particular,  if  trials  on  which  a 
fast  guess  occurs  are  associated  with  a  greater  pre-stimulus  activation  than 
slow  response  trials,  then  we  should  observe  a  larger  lateralized  readiness 
potential  during  the  foreperiod  of  fast  guess  trials.  On  the  other  hand,  if 
fast  guesses  occur  when  the  response  criterion  is  set  to  a  lower  level  than 
for  slow  response  trials,  there  should  be  less  lateralization  of  the 
readiness  potential  at  the  time  the  overt  response  is  initiated  on  fast 
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guess  trials. 

Post-st imu lus  activation 

Post-stimulus  activation  refers  to  the  activation  of  the  response 
channels  that  occurs  as  a  consequence  of  the  processing  of  the  imperative 
stimulus.  In  our  previous  work  (Coles  et  al.,  1985,  Eriksen  et  al.t  1985), 
we  have  found  evidence  to  suggest  that  responses  can  be  activated  in  a 
parallel  and  continuous  fashion  as  stimulus  evaluation  proceeds.  We  used  a 
choice  RT  paradigm  in  which  the  stimulus  was  made  up  of  a  central  target 
letter  flanked  by  noise  letters  (cf.  B.  Eriksen  &  C.  Eriksen,  1974).  The 
noise  letters  could  either  call  for  the  same  response  as  the  target 
(compatible  noise)  or  for  the  opposite  response  (incompatible  noise). 
Analysis  of  EMG  and  overt  response  measures  revealed  that,  on  some  trials, 
both  responses  were  initiated  (Coles  et  al.,  1985;  Eriksen  et  al.,  1985). 
These  double  responses  occurred  more  often  when  incompatible  noise  was 
presented,  suggesting  parallel  processing  of  target  and  noise  information. 
Furthermore,  conditional  accuracy  functions  (i.e.,  accuracy  conditional  on 
latency)  revealed  that  when  subjects  responded  quickly  to  incompatible  noise 
stimuli,  they  tended  to  make  errors.  Indeed,  they  erred  more  than  would  be 
predicted  on  the  basis  of  chance  responding  (Coles  et  al.,  1985).  On  the 
other  hand,  slow  responses  to  incompatible  noise  stimuli  were  associated 
with  high  accuracy.  The  accuracy  of  responses  on  compatible  noise  trials 
was  always  greater  than  chance.  These  data  suggest  that  responses  are 
sometimes  given  on  the  basis  of  a  preliminary  analysis  of  stimulus 
information,  whose  output  is  dominated  by  the  noise  letters.  In  the  case  of 
incompatible  noise,  the  output  of  this  preliminary  analysis  leads  to  the 
incorrect  response,  since  it  is  dominated  by  the  noise  letters. 

These  data  support  the  thesis  that  response  channels  can  be  activated 
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by  partial  analysis  of-stimulus  information.  Therefore,  the  lateralization 
of  the  readiness  potential  in  the  period  following  the  presentation  of  the 
imperative  stimulus,  and  preceding  the  overt  response,  should  provide 
evidence  of  the  influence  of  this  partial  analysis.  In  particular,  if  the 
imperative  stimulus  contains  incompatible  noise,  we  should  observe 
activation  of  the  incorrect  response. 

The  present  experiment 

We  have  proposed  that  several  factors  can  affect  the  activation  of 
response  channels.  These  include  pre-  and  post-stimulus  response 
activation.  Furthermore,  we  have  argued  that  the  lateral i zed  readiness 
potential  can  be  used  as  a  measure  of  the  relative  activation  of  response 
channels. 

These  ideas  were  explored  in  the  present  experiment.  In  particular,  we 
used  electrophysiological  measures  to  investigate  the  mechanisms  responsible 
for  fast  guesses,  and  to  provide  a  description  of  the  influence  of 
evaluation  processes  on  the  response  system.  To  this  end,  subjects 
performed  a  version  of  the  noise/compatibility  task  (cf.  Coles  et  al., 

1985).  They  were  instructed  to  respond  quickly,  and  a  large  number  of 
trials  were  collected,  so  as  to  provide  a  reasonable  sample  of  fast  guesses. 
Thus,  in  the  present  experiment  we  replicated  the  basic  design  of  the  Coles 
et  al.  (1985)  study.  However,  rather_than  focussing  on  the  temporal 
relationship  between  two  measures  of  peripheral  response  activation  (EMG  and 
overt  movement)  we  concentrated  on  a  more  central  measure  of  response 
activation.  In  particular,  we  recorded  the  readiness  potential  from  lateral 
scalp  electrodes  to  derive  an  analog  measure  of  the  relative  activation  of 
response  channels. 
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Method 

Subjects 

Six  graduate  students  (2  females)  at  the  University  of  Illinois,  were 
paid  $3.50  an  hour,  plus  bonuses,  for  participation.  The  subjects  (between 
23  and  25  years  old)  had  normal,  or  corrected  to  normal,  vision  and  hearing. 
The  subjects  were  not  informed  of  the  precise  purposes  of  the  experiment. 
Stimul i 

Each  trial  was  initiated  by  a  warning  tone  (1000  Hz,  50  ms  duration,  65 
dB  amplitude),  generated  by  a  Schlumberger  sine-square  audio  generator 
(model  SG-18A)  and  administered  binaurally  through  headphones.  One  thousand 
ms  after  the  warning  tone,  a  five  letter  array  (HHHHH,  SSHSS,  SSSSS,  or 
HHSHH)  was  presented  on  a  DEC  VT -11  CRT  display  for  100  ms.  The  interval 
between  two  consecutive  trials  varied  randomly  between  3500  and  5500  ms. 

The  subject  sat  facing  the  screen  at  a  distance  of  one  meter,  such  that  the 
angle  subtended  by  each  letter  was  approximately  0.5  degrees.  Thus,  the 
angle  subtended  by  the  whole  array  was  approximately  2.5  degrees.  A 
fixation  dot,  placed  .1  degrees  above  the  location  of  the  target  letter, 
remained  visible  throughout  the  experiment. 

Procedure 

Each  subject  took  part  in  one  practice  and  three  experimental  sessions. 
The  subjects  received  17  blocks  of  trials  during  each  of  the  four  sessions. 
In  the  first  15  blocks  (made  up  of  80  trials  each),  one  of  the  four  arrays 
was  presented  on  each  trial.  In  the  last  two  blocks  (made  up  of  40  trials 
each),  one  of  only  two  arrays  (either  HHHHH  and  SSHSS,  or  SSSSS  and  HHSHH) 
was  presented  on  each  single  trial.  Thus,  in  the  last  two  blocks  the 
central  stimulus  was  always  the  same,  making  these  blocks  equivalent  to  a 
"simple  RT"  task  rather  than  the  "choice  RT"  task  used  in  the  first  15 
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bTocks.2  In  each  case,  the  stimulus  arrays  were  equiprobable  (.25  for  each 
stimulus  array  in  the  first  15  blocks,  .50  in  the  last  two  blocks),  and 
their  presentation  order  was  randomized.  Each  subject  was  instructed  to 
respond  to  one  of  the  two  target  letters  (H  or  S)  with  one  hand  (left  or 
right),  and  to  the  other  target  letter  with  the  other  hand,  by  squeezing  a 
zero-displacement  dynamometer.  The  association  between  target  letter  and 
responding  hand  was  consistent  for  each  subject  and  counterbalanced  across 
subjects.  Speed  was  emphasized  over  accuracy  and  subjects  were  trained,  by 
means  of  verbal  feedback,  to  respond  at  a  speed  which  produced  error  rates 
that  ranged  between  .10  and  .20. 

Overt  Response  Measurement 

When  subjects  squeezed  the  zero-displacement  dynamometers  (Daytronic 
Linear  Velocity  Force  Transducers,  Model  152A,  with  Conditioner  Amplifiers, 
Model  830A,  see  Kutas  &  Oonchin,  1977),  a  voltage  was  generated  which  was 
proportional  to  the  force  applied  to  the  transducer.  This  signal  was 
digitized  at  100  Hz  for  1000  ms  following  array  presentation,  giving  a 
continuous  recording  of  the  force  output  of  both  hands  following  each 
stimulus.  A  Schmitt-trigger  could  be  set  to  any  preselected  force  level. 
When  the  force  reached  a  prescribed  criterion,  the  system  recorded  the 
occurrence  of  an  overt  "criterion"  response.  Before  the  practice  session, 
the  value  of  each  subject's  maximum  squeeze  force  was  determined  for  each 
hand  separately.  Criterion  values  for  each  subject  were  set  at  25*  of  the 
maximum  force  applied  by  that  subject  for  that  hand.  During  the  practice 
sessions  only,  a  click  was  presented  over  a  loud-speaker  whenever  the  force 
exerted  on  the  transducer  crossed  the  criterion. 
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Psychophysiological  Recording 

The  electroencephalogram  (EEG)  was  recorded  from  Fz,  Cz,  Pz,  (according 
to  the  10/20  system,  Jasper,  1958),  C3 '  (4  cm  to  the  left  of  Cz),  and  C4 '  (4 
cm  to  the  right  of  Cz)  referenced  to  linked  mastoids,  using  Burden  Ag/AgCl 
electrodes.  Vertical  and  horizontal  electrooculographic  activity  (EOG)  was 
recorded  using  Beckman  biopotential  Ag/AgCl  electrodes  placed  above  and 
below  the  right  eye,  and  at  2  cm  external  to  the  outer  canthus  of  each  eye. 
Ground  electrodes  were  placed  on  the  forehead.  The  EMG  was  recorded  by 
attaching  pairs  of  Beckman  electrodes  on  both  right  and  left  forearms,  using 
standard  forearm  flexor  placements  (Lippold,  1967).  For  EEG  and  EOG 
electrodes  the  impedance  was  less  than  5  KOhm,  for  EMG,  below  20  KOhm.  The 
EEG  and  EOG  signals  were  amplified  by  Grass  amplifiers  (model  7P122),  and 
filtered  on-line  using  a  high  frequency  cut-off  point  at  35  Hz  and  a  time 
constant  of  8  s  for  the  high  pass  filter.  The  EMG  signals  were  conditioned 
using  a  Grass  Model  7P3B  preamplifier  and  integrator  combination.  The 
preamplifier  had  a  half  amplitude  low  frequency  cut-off  at  0.3  Hz,  while  the 
output  of  the  integrator  (full  wave  rectification)  was  passed  through  a 
filter  with  time  constant  of  0.05  sec.  For  each  psychophysiological  measure 
(EEG,  EOG,  and  EMG)  and  each  trial,  the  derived  voltages  were  digitized  at 
100  Hz  for  2100  ms,  starting  100  ms  prior  to  the  presentation  of  the  warning 
tone,  and  ending  1000  ms  after  the  presentation  of  the  array. 

Data  Reduction 

Motor  responses.  As  we  noted  above,  the  subjects  were  required  to 
squeeze  the  dynamometers  to  a  criterion  of  at  least  25%  of  maximum  force  to 
register  a  "criterion  response."  This  response  criterion  was  used  for 
on-line  feedback  and  RT  was  defined  as  the  latency  at  which  this  criterion 
was  crossed.  However,  a  different  response  classification  system  was  used 
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for  most  of  the  analyses.  In  particular,  a  measure  based  on  the  onset 
latency  of  the  EMG  response  was  used. 

The  onset  latency  of  the  EMG  response  was  evaluated  on  each  trial 
according  to  the  following  procedure  (cf.  Coles  et  al.,  1985).  First,  a 
minimum  criterion  value  was  established  for  each  subject  to  discriminate  the 
EMG  response  from  random  variations  in  background  EMG.  Since  the  background 
EMG  was  typically  characterized  by  very  small  amplitude  activity,  the  EMG 
response  was  readily  identifiable.  When  the  integrated  EMG  exceeded  the 
minimum  criterion,  an  EMG  response  was  defined  as  having  started  and  the 
latency  of  this  activity  was  noted.  EMG  responses  in  both  arms  could  be 
observed  on  the  same  trial.  -In  the  present  paper,  we  will  focus  on  analyses 
based  on  the  onset  latency  of  the  first  EMG  response. 

ERP  measures.  For  each  single  trial,  the  EEG  data  were  corrected  for 
both  vertical  and  horizontal  ocular  movement  artifacts  using  a  modification 
of  the  procedure  described  in  Gratton,  Coles,  and  Donchin  (1983).  The 
corrected  single  trial  data  were  then  stored  for  further  analysis,  including 
computation  of  averages  and  analysis  of  ERP  component  parameters  on  each 
single  trial.  For  the  single  trial  analysis,  the  data  from  the  five  scalp 
electrodes  (Fz,  Cz,  Pz,  C3 ’  and  C41)  were  smoothed  using  a  low  pass  digital 
filter  (high  frequency  cut-off  point  at  3.14  Hz,  two  iterations),  and  the 
baseline  level  was  subtracted  by  averaging  the  f4rst  ten  points  of  the  epoch 
(corresponding  to  100  msec). 

The  readiness  potential  was  assessed  using  the  ERP  waveforms  recorded 
at  C 3 ’  and  C4 ' .  These  electrodes  are  placed  on  scalp  regions  close  to  brain 
motor  areas.  Previous  research  has  shown  that  the  amplitude  of  the 
readiness  potential  is  maximum  at  these  locations  when  squeeze  responses  are 
required  (Kutas  &  Donchin,  1977;  Bashore,  McCarthy,  Heffley,  Clapman,  & 
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Donchin,  1982). 

Results 

The  results  section  will  be  divided  into  several  parts.  First,  we 
describe  the  basic  conditional  accuracy  functions  for  the  two 
noise/compatibility  conditions.  Second,  we  present  data  concerning  the 
influence  of  pre-stimulus  response  activation  mechanisms  on  the  latency  and 
accuracy  of  overt  responses.  Third,  we  analyze  the  mechanisms  of 
post-stimulus  activation  by  studying  the  influence  of  stimulus  evaluation  on 
the  relative  activation  of  the  two  responses.  Fourth,  we  consider  the 
possibility  that  responses  are  released  when  relative  response  activation 
reaches  a  fixed  criterion.  Finally,  we  describe  an  analysis  of  the 
absolute,  rather  than  relative,  activation  of  each  response. 3 
Conditional  Accuracy  Functions 

Conditional  accuracy  functions  were  derived  for  compatible  and 
incompatible  conditions,  and  for  each  of  the  six  subjects,  by  computing 
response  accuracy  for  each  of  seven  50  ms  response  latency  bins  ranging  from 
100  to  449  ms  (cf.  Lappin  &  Ditsch,  1972a,  1972b;  Pachella,  1974).  The 
accuracy  and  latency  of  each  "response"  were  defined  either  in  terms  of  the 
first  criterion  squeeze  (Figure  1,  left  panels)  or  onset  of  the  first  EMG 
response  (Figure  1,  right  panels).  Average  functions  are  shown  in  the  tipper 
panels  of  Figure  1,  while-fehe  proportions  of  responses  for  each  latency  bin 
are  shown  in  the  lower  panels. 


Insert  Figure  1  About  Here 


The  conditional  accuracy  functions  based  on  the  two  response  definitions  are 
qualitatively  similar,  suggesting  that  both  measures  are  manifestations  of 
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the  activity  of  the  same  underlying  system.  However,  there  are  many  fewer 
criterion  squeeze  responses  than  EMG  responses  with  a  latency  shorter  than 
200  ms.  (In  fact,  only  three  subjects  had  criterion  squeeze  responses  with 
a  latency  less  than  150  ms.)  This  supports  the  view  that  EMG  responses 
represent  a  more  sensitive  measure  of  response  activation  than  the  squeeze 
response  (see  also  Coles  et  al.,  1985;  Eriksen  et  al.,  1985).  Furthermore, 
EMG  responses  are  less  affected  by  response  competition  than  are  squeeze 
responses  (see  Coles  et  al.,  1985  and  footnote  3).  These  two 
characteristics  of  the  EMG  onset  measure  make  it  more  suitable  than  the 
criterion  squeeze  response  as  an  index  of  peripheral  response  activation. 
Therefore  we  used  the  EMG  measure  in  the  remainder  of  this  paper  as  the 
basis  for  response  classification. 

Analysis  of  the  conditional  accuracy  functions  based  on  EMG  onset 
revealed  that  the  proportion  of  correct  responses  increased  with  increasing 
response  latency,  F(6,30)  =  59.78,  £<.001,  and  that  subjects  were  more  often 
correct  on  compatible  than  incompatible  trials,  F(l,5)  =  144.40,  £<.001. 

The  most  interesting  aspect  of  these  functions,  however,  was  that  they 
differed  in  shape,  as  reflected  in  the  significant  interaction  between 
compatibility  and  response  latency,  F(6,30)  =  9.96,  £<.001.  As  can  be  seen 
in  Figure  1,  the  most  noticeable  difference  between  the  functions  occurred 
at  response  latencies  of  between  150  and  249  ms.  For  responses  in  the  ISO- 
199  and  200-249  latency  bins,  accuracy  was  significantly  greater  than  chance 
(.50)  for  compatible  arrays,  t(5)=4.25  and  15.72  (one-tailed),  but 
significantly  below  chance  for  Incompatible  arrays,  t(5)=-3.47  and  -2.34 
(one-tailed).  This  differential  pattern  of  response  accuracy,  which  was 
evident  in  all  subjects,  suggests  that,  for  these  response  latencies,  the 
noise  letters  are  controlling  response  accuracy  (cf.  Coles  et  al.,  1985). 
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Since  the  noise  letter  in  incompatible  arrays  calls  for  the  incorrect 
response,  responses  for  these  arrays  tended  to  be  less  accurate  than  chance. 
On  the  other  hand,  very  fast  responses  (with  a  latency  of  less  than  150  ms), 
tended  to  have  a  chance  level  of  accuracy  regardless  of  the  compatibility  of 
the  array.  For  compatible  and  incompatible  arrays,  response  accuracy  was 
not  significantly  different  from  chance  (50%),  t(5)  =  0.80  and  -1.22, 
respectively.  Since  the  accuracy  of  these  responses  was  unaffected  by 
stimulus  information,  they  could  be  classified  as  "fast  guesses." 

Conversely,  slow  responses,  with  a  latency  greater  than  300  ms,  tended  to  be 
correct  regardless  of  the  compatibility  of  the  array.  As  we  have  argued 
previously,  the  accuracy  of  these  slower  responses  was  controlled  by 
analysis  of  target  letter  information  (cf.  Coles  et  al.,  1985). 

Pre-stimulus  activation 

In  this  section,  measures  of  the  lateralized  readiness  potential  are 
used  to  describe  the  pattern  of  response  activation  processes  that  occurs 
during  the  foreperiod  (between  warning  tone  and  array  presentation).  In 
particular,  we  focus  on  the  question  of  whether  the  accuracy  of  fast  guess 
responses  is  under  the  control  of  an  activation  process  that  occurs  before 
array  presentation.  Such  an  analysis  should  reveal  whether  these  fast 
responses  are  associated  with  a. higher  baseline  level  of  response 
activation,  as  predicted  by  the  variable  baseline  hypothesis. 

To  examine  the  activation  process,  we  computed  the  difference  in 
voltage  between  the  two  laterally  placed  electrodes  (C3‘  and  C4 ' )  for  every 
time  point  on  each  trial  beginning  100  ms  before  tone  onset  and  extending 
through  the  time  of  array  presentation.  As  we  noted  earlier,  the  potential 
at  these  electrode  sites  is  more  negative  on  the  side  contralateral  to  the 
responding  hand,  when  subjects  are  informed  which  hand  to  use  to  respond 
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(see  Kutas  &  Donch-in,  1980).  Thus,  we  predicted  that  the  direction  and 
degree  of  laterality  observed  during  the  foreperiod  would  be  related  to  the 
nature  of  the  response  to  the  array.  The  values  of  lateralization  were 
computed  on  the  basis  of  which  hand  happened  to  be  correct  on  a  particular 
trial,  so  that  negative  values  (upward  deflections  in  Figures  2,  4,  and  5) 
always  indicate  more  negativity  at  the  electrode  site  contralateral  to  the 
correct  response. 

To  compare  lateralization  waveforms  preceding  fast  and  slow  responses, 
we  sorted  trials  according  to  EMG  onset  latency  (using  50  ms  bins  ranging 
from  100  to  399  ms),  response  accuracy  (correct  or  incorrect  response)  and 
—noise/compatibility  (compatible  or  incompatible  noise).  To  obtain  stable 
estimates  of  the  lateral ized  brain  potential,  a  data  base  of  at  least  50 
trials  must  be  used  to  derive  average  waveforms.  In  fact,  the  shortest  EMG 
onset  latency  bin  (100  to  149  ms)  for  all  types  of  trial,  and  later  EMG 
onset  latency  bins  for  incorrect  compatible  trials,  did  not  match  the  50 
trial  criterion.  For  this  reason  we  had  to  restrict  our  analysis  to  the 
remaining  cells  (EMG  onset  latency  bins  between  150  and  399  ms  for  correct 
compatible  trials,  and  for  correct  and  incorrect  incompatible  trials).  Note 
that  the  shortest  EMG  onset  latency  bin  for  which  average  lateralization 
waveforms  were  computed  was  150  to  199  ms.  Although  the  conditional 
accuracy  data  indicated  that  fast  guess-trials  were  most  evident  in  the  100 
to  149  ms  bin,  a  substantial  number  of  the  trials  in  the  150  to  199  ms  bin 
are  also  fast  guesses,  since  accuracy  is  still  quite  close  to  chance  for 
this  bin. 

Figure  2  shows  average  laterality  values  from  incompatible  trials 
during  the  foreperiod  for  correct  and  incorrect  response  trials  with 
different  response  latencies  (150-199  ms  in  the  lower  panel  and  300-349  ms 
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in  the  upper  panel).  As  with  the  conditional  accuracy  functions,  we  used 
the  EMG  response  to  define  both  response  accuracy  and  response  latency. 


Insert  Figure  2  About  Here 


Inspection  of  Figure  2  suggests  that  the  readiness  potential  preceding 
fast  responses  becomes  lateral ized  during  the  foreperiod  and  that  the 
direction  of  the  laterality  is  related  to  the  correctness  of  the  subsequent 
response.  This  inference  is  supported  by  several  analyses.  First,  when 
laterality  is  defined  in  terms  of  the  mean  absolute  value  of  the  C3'-C4' 
difference  in  the  last  100  ms  of  the  foreperiod,  fast  responses  are 
associated  with  a  greater  degree  of  laterality  than  slow  responses  (the 
analysis  was  restricted  to  correct  responses  and  to  latency  bins  ranging 
between  150  and  399  ms  because  of  unstable  ERP  data  on  other  bins),  F(4, 

20 )  =  3 . 31 ,  £<.05.  Further  analysis  revealed  a  significant  linear  trend  in 
lateralization  as  a  function  of  EMG  onset  latency  bin,  F(l,  20)=10.45, 
£<.01.  Second,  the  direction  of  the  laterality  is  related  to  response 
accuracy.  Fast  correct  responses  are  associated  with  laterality  in  the 
direction  of  the  correct  response  (greater  negativity  at  the  electrode 
location  contralateral  to  the  correct  response),  while  fast  incorrect 
responses  are  associated  with  laterality  in  the -direction  of  the  incorrect 
response  (greater  negativity  at  the  electrode  location  ipsilateral  to  the 
correct  response),  £(1,  5 ) = 19 . 54 ,  £<.01.  (This  analysis  was  restricted  to 
incompatible  trials,  because  there  were  not  enough  incorrect  responses  to 
compatible  arrays  to  yield  stable  averages.) 

Finally,  we  classified  each  individual  trial  into  one  of  three 
categories:  trials  for  which  the  readiness  potential  was  larger  over  the 


Response  channels  and  psychophysiology 


Page  19 


motor  cortex  contralateral  to  the  correct  response  ("contralateral"  trials); 
trials  for  which  the  readiness  potential  was  larger  over  the  motor  cortex 
ipsi lateral  to  the  correct  response  ("ipsi lateral"  trials);  and  trials  for 
which  there  was  no  significant  difference  between  the  two  sides 
(" non-lateralized"  trials). 4  We  computed  the  conditional  accuracy  functions 
separately  for  each  of  these  categories  of  trials.  The  average  conditional 
accuracy  functions  for  compatible  and  incompatible  trials  computed  in  this 
way  are  shown  in  Figure  3. 


Insert  Figure  3  About  Here 


We  predicted  that  the  accuracy  of  fast  guesses  would  be  greater  than 
chance  when  the  correct  response  was  prepared  in  advance  of  stimulus 
presentation  -  that  is,  for  contralateral  trials  -  and  below  chance  when  the 
incorrect  response  was  prepared  in  advance  of  stimulus  presentation  -  that 
is,  for  ipsilateral  trials.  This  prediction  was  confirmed.  An  analysis  of 
variance  conducted  on  the  conditional  accuracy  functions,  for  which  accuracy 
was  used  as  a  dependent  variable  and  noise  compatibility,  response  latency 
bin  and  trial  category  were  used  as  factors,  indicated  a  significant  main 
effect  of  trial  category,  F(2,10)  =  5.66,  £  <.05,  as  well  as  a  significant 
2-way  interaction  between  trial  category  and  response  latency  bin,  F(10,50) 

=  2.88,  £  <.01.  Subsequent  analyses  revealed,  for  the  main  effect  of  trial 
category,  a  significant  linear  trend  with  decreases  in  accuracy  from 
Contralateral  to  Non-lateralized  to  Ipsilateral  trials,  and,  for  the 
interact  ion,  that  this  linear  trend  was  significant  only  for  the  first 
response  latency  bin  (between  100  and  149  ms). 

Taken  together,  these  data  suggest  that  subjects  sometimes  chose  and 
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activated  a  particular  response  during  the  foreperiod  before  the  array  was 
presented,  although,  objectively,  each  response  was  equiprobable.  When  a 
fast  response  was  emitted  on  these  trials,  its  accuracy  depended  on  whether 
the  subject  happened  to  have  chosen  the  correct  response.  This  is 
consistent  with  our  interpretation  that  these  trials  are  fast  guesses. 
However,  the  data  presented  so  far  are  not  sufficient  to  conclude  that  fast 
guesses  occur  when  there  is  a  high  level  of  pre-stimulus  response 
activation.  Data  bearing  upon  this  issue  will  be  presented  later,  when  we 
consider  a  measure  of  individual  response  channel  activation  (the 
single-sided  readiness  potential).  The  presence  of  variable  levels  of 
response  activation  during  the  foreperiod  and  their  relationship  to  response 
accuracy  is  consistent  with  the  variable  baseline  hypothesis  of  response 
activation.  In  fact,  it  appears  that  the  relative  activation  of  response 
channels  in  the  foreperiod  is  related  to  the  accuracy  of  a  fast  guess. 
Post-stimulus  activation 

In  this  section,  we  show  how  measures  of  the  readiness  potential  are 
sensitive  to  the  modulation  of  response  channel  activation  by  stimulus 
evaluation  processes.  In  particular,  these  measures  support  the  view  that 
stimulus-related  response  activation  can  occur  prior  to  complete  stimulus 
evaluation.  — 

As  described  above,  we  computed  the  difference  between  the  electrical 
potential  recorded  from  the  two  lateral  electrodes  (C3‘  and  C4 ' )  on  each 
trial.  However,  we  now  focus  on  the  post-stimulus  period  from  array 
presentation  until  the  end  of  the  recording  epoch  1000  ms  later.  Trials 
were  sorted  as  a  function  of  array  compatibility  and  average  lateralization 
values  were  computed.  The  resulting  average  waveforms  are  shown  in  the 
middle  panel  of  Figure  4.  Note  that  these  waveforms  include  all  trials 
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-regardless  of  response  accuracy  or  latency. 


Insert  Figure  4  About  Here 


If  the  response  channels  are  continually  influenced  by  information 
coming  from  stimulus  evaluation  processes,  then  we  would  expect  that 
post-array  values  of  laterality  would  reveal  this  influence.  We  argued 
previously  on  the  basis  of  the  conditional  accuracy  functions  (reproduced  in 
the  upper  panel  of  Figure  4)  that  responses  with  a  latency  between  150  and 
249  ms  appeared  to  be  affected  more  by  the  noise  letters  than  by  the  target 
letter,  suggesting  an  influence  of  preliminary  phases  of  the  stimulus 
evaluation  process  on  response  channels.  In  fact,  the  average 
lateralization  waveform  for  incompatible  noise  exhibits  a  "dip"  towards 
incorrect  response  activation  between  150  and  250  ms  post-stimulus  (see 
middle  panel  of  Figure  4).  This  dip  corresponds  to  greater  negativity  at 
the  scalp  site  contralateral  to  the  incorrect  response.  Note  that  the 
waveforms  shown  in  the  middle  panel  of  Figure  4  correspond  very  closely  to 
the  conditional  accuracy  functions  shown  in  the  upper  panel.  The 
correlations  computed  between  the  average  laterality  (sampled  at  the 
beginning  of  each  response  latency  bin)  and  conditional  accuracy  values  for 
each  of  the  seven  bins  were  .93  for  compatible  and  .94  for  incompatible 
arrays  (p<.01).  When  these  correlations  were  coiiputed  on  an  individual 
subject  basis,  the  mean  value  for  compatible  arrays  was  .87  (.05  confidence 
interval  .79-. 92)  while  that  for  incompatible  arrays  was  .89  (.05  confidence 
interval  .82-. 94). 

These  data  point  to  a  remarkable  convergence  between  two  quite 
different  procedures  used  to  gain  insights  into  the  influence  of  the 
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evaluation  process  on  the  response  channels.  One  procedure  is  based  on 
measures  of  scalp  activity  and  the  other  on  the  latency  and  accuracy  of 
responses. 

While  both  procedures  converge  in  their  description  of  the  time-course 
of  evaluation  processes,  the  accuracy  of  this  description  is  compromised  by 
the  fact  that  both  procedures  are  based  on  data  aggregated  over  trials. 
Because  of  this,  it  is  not  clear  whether  the  functions  depicted  in  the  upper 
and  middle  panels  of  Figure  4  represent  the  nature  and  time  course  of  the 
evaluation  process  on  individual  trials.  The  figures  suggest  that 
information  about  the  array  is  accumulated  gradually  and  that  this 
information  is  continuously  available  to  response  systems.  However,  the 
gradually  increasing  continuous  function  that  characterizes  conditional 
accuracy  curves  can  be  generated  by  aggregating  trials  for  which  the  output 
of  stimulus  evaluation  is,  in  fact,  discrete  but  variable  in  latency  over 
trials.  Similarly,  the  description  provided  by  the  aggregated  laterality 
data  could  be  the  result  of  averaging  trials  for  which  a  discrete  change  in 
laterality  occurred  (in  the  direction  of  the  response  that  was  executed)  but 
for  which  the  change  varied  in  latency.  To  demonstrate  that  indeed  response 
activation  occurs  as  stimulus  evaluation  proceeds,  we  need  to  demonstrate 
the  influence  of  different  phases  of  the  stimulus  evaluation  process  on  the 
response  system  within  the  same  trial. 

To  this  end,  we  partitioned  the  laterality  data  as  a  function  of 
response  latency  and  accuracy.  Since  we  were  particularly  concerned  with 
finding  evidence  for  an  Influence  of  preliminary  evaluation  processes  on  the 
response  systems,  we  focussed  on  the  "dip"  in  laterality  seen  in  the 
waveform  for  Incompatible  arrays  in  the  middle  panel  of  Figure  4.  We 
argued  above  that  this  "dip"  represents  early  activation  of  the  Incorrect 
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response  due  to  the  influence  of  the  incompatible  noise  letters.  We 
reasoned  that  if  we  could  find  evidence  for  this  "dip"  on  trials  in  which 
the  first  peripheral  response  given  was  correct,  then  we  could  infer  that, 
indeed,  on  these  trials  at  least,  preliminary  evaluation  processes  could 
result  in  preliminary  response  activation.  Furthermore,  the  evidence  for  an 
influence  of  evaluation  processes  on  response  systems  should  be  most  evident 
for  relatively  slow  trials,  since  fast  trials  are  Influenced  by  pre-stimulus 
activation  effects,  as  we  saw  earlier.  Thus,  we  focussed  our  analysis  on 
relatively  slow,  correct  response  trials  only  (with  a  response  latency  of 
300-349  ms).  On  these  trials,  by  definition,  the  EMG  activity  was  first 
observed  on  the  correct  side.  The  lower  panel  of  Figure  4  gives  the 
laterality  values  for  these  trials. 

The  "dip"  can  be  clearly  seen  in  the  waveform  for  the  incompatible 
arrays  while  a  simultaneous  increase  in  laterality  toward  the  correct 
response  can  be  seen  in  the  waveform  for  the  compatible  arrays.  The 
difference  between  compatible  and  incompatible  arrays  in  the  area  of  the 
waveform  between  170  and  210  ms  after  the  array  presentation  was 
significant,  t(5)=2.67,  £<.05  (one-tailed),  as  was  the  difference  between 
the  same  area  measure  for  incompatible  arrays  and  zero,  t(5)=-2.78,  £<.05 
(one-tai led) . 

Thus,  the  measure  of  the  lateralized  readiness  potential  suggests  that 
processing  of  the  noise  letters  can  result  in  preliminary  incorrect  response 
activation,  even  though  the  correct  overt  response  is  ultimately  given 
presumably  as  a  result  of  a  later  occurring  analysis  of  the  target  letter. 
This  influence  is  evident  for  long  latency  responses  for  which  pre-stimulus 
activation  effects  are  minimal. 
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Response  execution 

Grice's  variable  criterion  hypothesis  (Grice  et  al.,  1982)  leads  to  the 
prediction  that  the  level  of  activation  of  the  response  channels  at  the 
moment  of  the  response  should  be  lower  for  fast  than  slow  responses,  while  a 
fixed  criterion  hypothesis  predicts  a  constant  level,  regardless  of  response 
latency.  Figure  5  shows  the  average  laterality  waveforms  from  100  ms  before 
the  warning  stimulus  until  the  time  of  the  response.  Waveforms  for  the 
compatible  incorrect  trials  are  not  included  because  of  insufficient  data 
(see  above). 


Insert  Figure  5  About  Here 


To  evaluate  the  merit  of  the  variable  criterion  hypothesis,  we  focus 
on  the  laterality  value  at  the  time  an  EMG  response  was  observed  (indicated 
by  vertical  lines  in  Figure  5). 5  An  analysis  of  variance  of  these  data 
revealed  no  significant  effects  of  compatibility  (£(1,  5)  =  3.02,  £>.05)  or 
response  latency  (both  the  main  effect  of  response  latency  bin  and  the 
interaction  of  response  latency  bin  by  response  accuracy  were  not 
significant,  £(3,15)  =  0.13,  and  £(3,15)  =  0.46,  respectively),  but  a 
significant  effect  of  accuracy,  £(l-r5)  =  133.39,  £  <.001.  In  fact,  the  mean 
laterality  values  were  -0.71  and  +0.65  microvolts  for  correct  and  incorrect 
responses  respectively.  These  data  indicate,  then,  that  the  absolute 
laterality  value  at  the  moment  the  EMG  response  is  triggered  is  constant 
regardless  of  response  latency  and  that,  as  expected,  the  direction  of 
laterality  is  related  to  response  accuracy. 

Individual  response  channel  activation 


The  measure  of  the  lateralization  of  the  readiness  potential  we  have 
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used  so  far  provides  only  an  index  of  the  relative  activation  of  the 
response  channels.  This  relative  measure  cannot  reveal  generalized 
facilitation  of  both  responses,  nor  can  it  distinguish  cases  in  which  one 
response  is  activated  from  cases  in  which  the  other  response  is  inhibited. 

A  related  problem  is  that,  conceptually,  it  is  reasonable  to  expect  the 
occurrence  of  a  fast  guess  to  be  dependent  on  the  absolute  level  of 
activation  of  a  response  channel.  Similarly,  the  emission  of  an  EMG 
response  may  depend  on  the  absolute  level  of  response  channel  activation. 

For  these  reasons,  it  would  be  useful  to  obtain  a  separate  measure  of 
the  activation  of  each  response  channel.  To  derive  such  a  measure  we  have 
to  solve  the  methodological  problem  of  isolating  ERP  components.  In  fact, 
the  potential  observed  at  each  of  the  lateral  scalp  electrodes  (C3 '  and  C4 ' ) 
is  given  by  the  sum  of  potentials  generated  locally  (i.e.,  the  single-side 
readiness  potential)  and  of  other  potentials  generated  in  other  regions  of 
the  brain  and  propagated  by  volume  conduction.  Therefore,  to  obtain  "pure" 
measures  of  the  potential  associated  with  the  activation  of  just  one 
response  channel,  we  need  to  remove  the  influence  of  the  other  potentials. 
Note  that  when  we  measure  the  lateralization  of  the  readiness  potential 
(i.e.,  the  difference  between  C3‘  and  C4 ' ) ,  the  influence  of  other  brain 
potentials,  not  associated  with  a  specific  motor  response,  Js  eliminated  by 
the  subtraction  procedure. 

We  approached  the  problem  of  isolating  the  single-sided  readiness 
potential  by  using  the  Vector  Filter  procedure  (Gratton,  Coles,  &  Donchin, 
1987).  A  description  of  the  procedure  and  its  derivation  is  given  in  the 
Appendix . 

Examples  of  the  waveforms  obtained  with  the  procedure  are  shown  in 
Figure  6.  The  waveforms  represent  the  estimated  activation  of  correct  and 
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incorrect  response  -channels  throughout  the  experimental  epoch  for  four 
response  latency  bins.  Average  waveforms  were  computed  separately  for  each 
compatibility  condition  and  for  correct  and  incorrect  trials.  The  8 
waveforms  for  the  compatible  correct  condition  shown  in  Figure  6  are 
representative  of  the  32  waveforms  available. 

Several  aspects  of  this  figure  are  noteworthy.  First,  during  the 
foreperiod,  there  is  greater  activation  of  the  correct  side  for  fast  than 
for  slow  trials.  To  determine  whether  the  absolute  amplitude  of  the 
readiness  potential  predicts  if  a  fast  guess  will  occur,  we  computed  the 
probability  that  a  fast  response _(i.e. ,  a  response  with  a  latency  shorter 
than  200  ms)  would  be  emitted  as  a  function  of  the  level  of  negativity  prior 
to  the  array  at  the  electrode  contralateral  to  the  responding  hand.  The 
level  of  negativity  prior  to  the  array  was  assessed  by  computing  the  average 
single-sided  readiness  potential  in  the  last  100  ms  of  the  foreperiod  at  the 
electrode  site  contralateral  to  the  correct  response  (when  the  probability 
of  emitting  a  correct  response  was  concerned)  and  at  the  electrode 
contralateral  to  the  incorrect  response  (when  the  probability  of  emitting  an 
incorrect  response  was  concerned).  We  sorted  the  trials  according  to  their 
prior  negativity  into  5  categories,  ordered  from  a  large  to  a  low  level  of 
negativity  prior  to  the  array.  The  prediction  that  a  fast  response  was  more 
likely  tou occur  on  trials  with  a  larger  negativity  prior  to  the  array  was  _ 
confirmed,  £(4,  20)  =  2.96,  £  <.05.  The  probability  ranged  from  .144  to 
.108  from  the  high  to  the  low  level  of  negativity.  A  post-hoc  analysis 
revealed  a  significant  linear  trend  In  probability  as  a  function  of  prior 
negativity,  F(l,  20)=7.36,  £<.02.  Thus,  the  probability  of  a  fast  response 
was  a  function  of  the  level  of  negativity  prior  to  the  array  at  the 
electrode  contralateral  to  the  responding  hand. 
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A  second  interesting  aspect  of  Figure  6  is  that  there  appear  to  be  a 
fixed  level  of  activation  of  the  response  channels  at  the  moment  the  EMG 
response  is  triggered.  Thus,  the  picture  of  response  activation  processes 
provided  by  Figure  6  (and  by  comparable  waveforms  for  the  other  conditions) 
confirms  the  results  reported  in  the  previous  sections  (see  Figure  5). 


Insert  Figure  6  About  Here 


The  derivation  of  single-sided  readiness  potential  measures  enabled  us 
to  focus  on  one  additional  question,  that  Is,  the  effect  of  mere  stimulus 
—  presentation  on  response  channels.  One  striking  common  aspect  of  all  the 
individual  response  channel  functions  (those  shown  in  Figure  6  and  24  other 
comparable  waveforms  from  incompatible  and  incorrect  trials)  is  that  both 
correct  and  incorrect  response  channels  show  an  increase  in  activation 
between  50  and  150  ms  after  the  array.  In  the  case  of  very  fast  response 
trials,  when  the  level  of  activation  of  a  response  channel  is  close  to  the 
criterion  due  to  priming  in  the  foreperiod,  this  increase  in  activation 
appears  to  raise  the  function  above  the  threshold  for  the  emission  of  an  EMG 
response. 

Discussion 

In  this  paper,  we  have  presented  an  analysis  of  response  channel 
activation  in  a  warned,  choice  RT  paradigm,  in  which  the  stimulus 
information  was  sometimes  conflicting.  We  proposed  that  activation  can  be 
modulated  by  several  mechanisms  including  those  related  to  pre-stimulus 
activation  or  bias  and  those  related  to  stimulus  evaluation  processes. 

Measures  of  the  lateral ized  readiness  potential  Indicate  that,  on  some 
trials,  subjects  appeared  to  select  and  activate  particular  responses  during 
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the  foreperiod  before  the  imperative  stimulus  (array)  was  presented.  In 
fact,  fast  responses  (with  a  latency  of  less  than  200  ms)  were  associated 
with  large  and  significant  lateralization  of  the  readiness  potential  during 
the  foreperiod.  If  the  lateralization  was  in  the  direction  associated  with 
the  correct  response,  the  response  tended  to  be  accurate.  If  it  was  in  the 
direction  associated  with  the  incorrect  response,  the  response  tended  to  be 
inaccurate.  For  these  fast  responses,  the  lateralization  of  the  readiness 
potential  was  similar  to  that  observed  in  simple  RT  conditions  when  the 
subject  knows  in  advance  the  hand  to  be  used  to  make  a  response  (cf.  Kutas  & 
Donchin,  1980).  Thus,  we  infer  that  a  fast  guess  was  preceded  by  selection 
and  activation  of  a  response  in  advance  of  stimulus  presentation. 
Furthermore,  when  pre-stimulus  selection  and  activation  occurred,  the  mere 
presentation  of  the  stimulus  appeared  to  trigger  a  response.  This  "mere 
presentation"  effect  may  be  due  to  a  "response  energizing"  process,  such  as 
that  proposed  by  Grice  et  al.  (1982)  and  Sanders  (1981).  Preliminary 
evidence  for  such  a  process  is  provided  by  the  analysis  of  single-sided 
readiness  potential  measures,  which  suggests  that,  regardless  of  response 
latency,  both  correct  and  Incorrect  response  channels  exhibited  increased 
activation  following  stimulus  presentation.  However,  this  increased 
activation  might  also  be  attributed  to  some  stimulus-processing  activity, 
not  completely  eliminated  by  our  procedure  for  isolating  the  single-sided 
readiness  potential  (see  Appendix).  * 

After  stimulus  presentation,  responses  appear  to  be  activated  as 
information  about  the  stimulus  becomes  available  -  unless,  of  course,  a  fast 
guess  is  made.  In  particular,  conditional  accuracy  functions  and 
lateralized  readiness  potential  values  suggest  a  tendency  for  incorrect 
response  activation  between  150  and  250  ms  after  the  presentation  of  an 
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incompatible  array.  Because  we  were  able  to  demonstrate  the  presence  of 
this  incorrect  response  activation  for  correct  trials  of  relatively  constant 
latency  (300-349  ms),  we  inferred  that,  at  least  on  some  trials,  stimulus 
evaluation  processes  may  affect  the  response  systems  before  the  evaluation 
processes  are  complete. 

Finally,  we  have  shown  that  responses  (EM6  activity)  occur  when 
response  activation  achieves  a  particular  fixed  level.  Analyses  of  both  the 
lateralized  and  single-sided  readiness  potential  at  the  time  of  response 
onset  indicated  that  the  amplitude  of  the  readiness  potential  was  fixed  for 
all  response  latencies. 

These  results  have  implications  for  various  issues  in  the  study  of 
human  information  processing.  First,  they  provide  support  for  a 
"variable-baseline/fixed-criterion"  hypothesis  of  response  activation,  in 
contrast  to  a  "f ixed-basel ine/variable-criterion"  hypothesis.  Specifically, 
we  have  demonstrated  that  a  measure  that  is  intimately  related  to  response 
processes  (the  readiness  potential)  varies  in  the  foreperiod  and  that  this 
variability  is  related  to  the  speed  and  accuracy  of  responses.  We  have  also 
shown  that  the  same  measure  appears  to  have  a  fixed  value  at  the  time  of  the 
response,  regardless  of  response  latency.  While  the  latter  observation 
suggests  that  a  fixed  level  of  activation  of  some  central  structure  - 
determines  whether  a  peripheral  response  will  be  activated,  it  is  of  course 
possible  that  variable  criteria  operate  at  earlier  phases  of  response 
activation,  although  our  measures  of  activation  provide  no  evidence  about 
the  operation  of  variable  criteria.  It  should  be  noted  that  the  portion  of 
ERP  waveform  that  is  associated  with  the  moment  at  which  the  EMG  response  is 
triggered  may  reflect  the  execution  of  the  response  (a  fixed  phenomenon) 
rather  than  the  preparation  of  the  response  (a  variable  phenomenon). 


Response  channels  and  psychophysiology 


Page  30 


However,  recent  neurophysiological  data-question  the  distinction  between 
response  execution  and  preparation.  In  fact,  Requin  (1985)  showed  that 
there  are  three  classes  of  neurons  in  the  motor  cortex  that  fire  during  a 
warned  RT  trial.  Some  fire  only  during  the  foreperiod,  others  fire  only 
during  the  response,  and  still  others  fire  during  both  the  foreperiod  and 
the  response.  While  our  scalp  recordings  cannot  distinguish  among  the 
activity  of  these  three  classes  of  neurons,  we  should  note  that  Requin 
(1985)  proposes  that  the  three  classes  operate  as  an  integrated  system.  We 
consider  this  neurophysiological  system  as  constituting  a  central  portion  of 
the  response  channel.  Thus,  response  preparation  and  response  execution  can 
be  represented  by  different  aspects  of  the  response  channel  activity. 

A  second  issue  addressed  by  the  present  experiment  concerns  the  nature 
of  the  stimulus  evaluation  process.  The  interference  determined  by 
irrelevant  noise  presented  visually  in  close  proximity  to  target  information 
has  been  studied  extensively  (for  a  review  see  Johnston  &  Dark,  1986).  In 
the  present  study,  conditional  accuracy  and  lateralization  data  converge  in 
suggesting  that  there  are  at  least  two  phases  in  stimulus  evaluation. 

During  the  first  phase  (between  150  and  250  ms)  the  correct  response  tends 
to  be  activated  when  the  noise  letters  are  compatible,  and  the  incorrect 
response  tends  to  be  activated-when  the  noise  letters  are  incompatible. 

Later  in  time,  the  correct  response  is  activated  regardless  of  the  nature  of 
the  noise  letters.  As  we  have  noted  previously  (Coles  et  al.,  1985),  the 
first  phase  appears  to  correspond  to  an  analysis  of  all  the  letters  or 
features  in  the  array  without  regard  to  their  location,  while  the  second 
phase  may  be  related  to  the  association  of  the  letters  with  their  location. 
This  two-phases  conception  of  stimulus  evaluation  processes  Is  consistent 
with  the  feature  integration  theory  of  Treisman  and  Gelade  (1980,  see  also 
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Treisman,  Sykes,  &  Gelade,  1977),  and  with  the  "zoom-lens"  analogy  proposed 
by  Eriksen  and  Yeh  (1985,  see  also  Naatanen,  1982).  Our  data  appear  to  be 
consistent  with  the  interpretation  that  location  plays  an  important  role  in 
visual  attention  (Johnston  &  Dark,  1986).  However,  since  we  did  not 
manipulate  information  about  location,  our  data  cannot  serve  to  determine 
whether  this  role  is  qualitatively  different  from  that  of  other  stimulus 
dimensions  (as  proposed  by  the  zoom-lens  analogy)  or  similar  to  that  of 
other  stimulus  dimensions  (as  proposed  by  the  feature-integration  theory). 

The  conditional  accuracy  functions  also  reveal  that  accuracy  is  between 
.90  and  .95  for  incompatible  arrays  even  at  long  response  latencies,  when 
accuracy  might  be  expected  to  approximate  unity.  This  suggests  that  the 
stimulus  evaluation  system  sometimes  fails  to  correctly  identify  the  target 
letter,  perhaps  because  its  operation  terminates  when  some  degree  of 
certainty  about  the  stimulus  is  reached,  or  because  its  performance  is 
sometimes  data-1 imited. 

A  third  issue  addressed  by  the  present  experiment  concerns  the 
possibility  that  stimulus  information  can  affect  response-related  processes 
before  the  evaluation  process  is  completed.  Our  measure  of  preliminary 
response  activation  (at  a  central  level)  showed  that  incorrect  responses  can 
be  partially  activated  on  the  same  trials  for  which  a  correct  overt  response 
is  given  later.  These  data  are  inconsistent  with  a  single-stage  decision 
model  that  places  the  decision  stage  at  an  earlitr  level  than  that 
manifested  by  the  readiness  potential  measures.  Furthermore,  our  previous 
findings  (Coles  et  al.,  1985)  are  inconsistent  with  a  model  that  places  a 
single-decision  stage  at  an  earlier  level  than  the  peripheral  response  (EMG 
or  squeeze).  In  the  earlier  study,  we  found  that  both  correct  and  incorrect 
peripheral  responses  could  be  activated  on  the  same  trial.  Taken  together, 
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these  data  argue_against  single-decision  stage  models  (e.g.  Sternberg,  1969) 
because  information  is  apparently  transmitted  from  stimulus  evaluation  to 
response  systems  at  at  least  two  moments  in  time.  This  is  consistent  with 
continuous  flow  (Eriksen  &  Schultz,  1979;  Grice  et  al.,  1982)  or 
multiple-decision-stage  models  (Miller,  1982). 

In  conclusion,  we  have  illustrated  how  the  concept  of  response  channels 
and  the  measurement  of  their  activation  can  provide  insights  into  the 
mechanisms  involved  in  a  warned,  choice  RT  task.  Our  data  suggest  that 
response  channels  are  continuously  active,  and  that  when  this  activity 
crosses  a  threshold,  a  peripheral  response  is  emitted.  Reaction  times 
measure  the  latency  at  which  the  threshold  is  crossed  and  the  overt  response 
is  emitted.  However,  they  provide  little  information  about  the  behavior  of 
the  response  channel  activation  function  in  the  period  preceding  the  moment 
at  which  the  overt  response  is  emitted.  In  our  previous  work,  we  used 
measures  of  the  EMG  to  provide  a  second  point  in  the  description  of  the 
response  channel  activation  function  (Coles  et  al.,  1985;  Eriksen  et  al., 
1985).  In  the  present  experiment,  we  extended  our  measurement  repertoire  to 
include  a  continuous  measure  (the  lateralization  of  the  readiness  potential) 
and  we  have  illustrated  its  power  in  providing  insights  into  various  aspects 
of  the  human  information  processing  system. 
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Footnotes 

1.  Another  negative  component,  labelled  Contingent  Negative  Variation 
(CNV ,  Walter,  Cooper,  Aldridge,  McCallum,  &  Winter,  1964)  can  be  observed  in 
the  foreperiod  of  warned  RT  tasks  (see,  for  instance.  Coles  et  al.,  1985). 

The  distinction  between  the  readiness  potential  and  the  CNV  has  recently 
been  debated  (see  Rohrbaugh  &  Gail  lard,  1983).  In  particular,  the 
possibility  of  a  CNV  in  the  absence  of  an  overt  response  has  been  discussed. 
The  arguments  related  to  this  debate  are  beyond  the  scope  of  this  paper.  We 
note  that,  in  general,  the  literature  about  CNV  has  focussed  on  its 
relationship  to  "attentional "  or  "alertness"  states,  while  that  on  the 
readiness  potential  has  focussed  on  its  relationship  with  motor  responses. 
Since  we  will  consider  the  scalp  negativity  preceding  the  stimulus  in  terms 
of  its  relationship  with  the  motor  response,  we  will  use  the  label 
"readiness  potential"  to  refer  to  it. 

2.  The  simple  RT  condition  was  used  to  determine  whether  in  fact  a 
scalp  lateralization  of  the  readiness  potential  could  be  observed  under 
conditions  of  extreme  response  bias.  Since  such  a  result  was  in  fact 
obtained,  we  will  not  discuss  this  condition  further. 

3.  In  our  previous  study  (Coles  et  al.,  1985),  we  analyzed  both  EMG  and 
squeeze  onset  latencies.  Squeeze  and  EMG  latency  data  from  the  present 
study  replicated  those  obtained  previously  for  the  "random-warned" 
condition.  In  particular,  for  incompatible  arrays  relative  to  compatible 
arrays,  we  observed  (a)  longer  RTs  (by  41  ms),  F ( 1 ,5)=59. 16,  £<.001,  (b) 
longer  intervals  between  EMG  and  squeeze  onsets  (by  6  ms),  F(l,5)=7.66, 
£<.05,  and  (c)  more  trials  for  which  both  squeeze  responses  were  initiated, 
F( 3, 15)  =  24 . 30,  £<.001.  Furthermore,  on  trials  where  both  squeeze  responses 
were  initiated,  the  EMG  to  squeeze  interval  was  prolonged  by  18  ms,  F(2, 
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10 )  =  32 . 52 ,  £<.001.  Thus,  data  from  the  present  experiment  confirm  that 
response  competition  is  a  contributing  factor  to  the  differences  in  mean  RTs 
between  compatible  and  incompatible  noise  stimuli. 

4.  Estimates  of  the  magnitude  of  the  lateralized  readiness  potential  on 
each  single  trial  were  obtained  at  each  electrode  by  computing  the  average 
EEG  activity  of  the  100  ms  preceding  the  appearance  of  the  stimulus  array. 
Single  trials  were  then  classified  into  one  of  three  categories 
(IPSILATERAL,  CONTRALATERAL,  and  NON-LATERALIZED) ,  by  comparing  the 
readiness  potential  at  C 3 1  and  C4‘ .  The  voltage  difference  between  these 
two  electrodes  was  transformed  into  standard  scores  (for  each  subject).  If 
the  standardized  difference  value  between  the  two  electrodes  was  between 
+0.5  and  -0.5,  the  trial  was  said  to  be  "non-lateral ized."  If  the 
difference  was  larger  than  this  criterion,  the  trial  was  said  to  be 

"ipsi lateral "  if  there  was  more  negativity  at  the  electrode  ipsi lateral  to 
the  hand  corresponding  to  the  correct  response,  and  "contralateral"  if  there 
was  more  negativity  at  the  electrode  contralateral  to  the  hand  corresponding 
to  the  correct  response. 

5.  To  account  for  the  transmission  delay  between  the  cortex  and  the 
muscle,  we  actually  sampled  the  value  of  the  lateralized  readiness  potential 
at  the  beqinni ng  of  each  EMG  onset  latency  bin  --  that  is,  on  the  average, 

25  ms  before  the  onset  of  EMG  activity. 
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Appendix 

The  potential  recorded  at  each  of  the  two  lateral  electrodes  (C31  and 
C4 ' )  can  be  considered  as  being  given  by  the  sum  of  the  readiness  potential 
generated  locally  (that  is,  in  the  region  of  the  brain  close  to  the 
electrode)  and  of  scaled  fractions  of  potentials  generated  in  other  parts  of 
the  brain  and  that  are  propagated  by  volume  conduction.  The  other 
potentials  may  include  the  readiness  potential  generated  underneath  the 
contralateral  electrode,  as  well  as  other  ERP  components.  This  can  be 
expressed  by  the  following  formulae: 

(la)  V (C 3 ' )  =  al*V(C3't)  +  a2*V(C4't)  +  bl*V(ERPl)  +  ...  +  bn*V(ERPn) 

(lb)  V(C4')  =  al*V(C4't)  +  a2*V(C3‘t)  +  bl*V(ERPl)  ♦  ...  +  bn*V(ERPn) 

where: 

V(C3')  and  V(C4')  are  the  potentials  recorded  at  C3 '  and  C4‘ ; 

V(C3't)  and  V(C4't)  are  the  "true"  readiness  potentials  generated 
underneath  C3'  and  C4' ; 

V(ERP)  are  potentials  associated  with  other  ERP  components  that  may  affect 
the  recordings; 

al,  a2,  bl,  and  bn,  are  scaling  factors. 

By  subtracting  (lb)-from  (la)  we  get  (2): 

(2)  V ( C3 ' ) -V (C4 ' )  =  (al-a2)  *  ( V(C3* t)-V(C4' t) | 

Thus,  apart  from  a  scaling  factor,  the  difference  between  the  potentials 
observed  at  the  lateral  electrodes  is  equal  to  the  difference  between  the 
"true"  readiness  potentials. 

While  the  derivation  of  the  differential  measure  is  quite 
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straightforward,  the- separate  measurement  of  the  readiness  potential  on  each 
side  is  more  problematic.  In  fact,  (la)  and  (lb)  can  be  transformed  in  (3a) 
and  (3b): 

(3a)  al*V(C3 ' t)  =  V(C3')  -  a2*V(C4't)  -  bl*V(ERPl)  -  ...  -  bn*V(ERPn) 

(3b)  al*V(C3't)  =  V(C3‘)  -  a2*V(C4't)  -  bl*V(ERPl)  -  ...  -  bn*V(ERPn) 

From  (3a)  and  (3b)  it  is  evident  that,  in  order  to  obtain  the  value  of  the 
readiness  potential  on  each  side,  the  contribution  of  components  with  a 
significant  influence  on  C3‘  and  C4‘  must  be  estimated  and  subtracted  from 
the  voltages  recorded  at  C3 '  and  C4 ' .  Gratton,  Coles,  and  Donchin  (1987) 
have  developed  a  procedure  (Vector  Filter)  that  allows  the  investigator  to 
separate  the  contribution  of  ERP  components  characterized  by  different 
distribution  over  the  scalp  electrodes.  This  technique  is  based  on  the 
development  of  a  series  of  "spatial  filters,"  each  specific  for  a  different 
component.  These  filters  are  given  by  a  series  of  weights,  one  for  each 
electrode.  The  filtered  data  are  represented  by  linear  combinations  of  the 
values  observed  at  each  electrode.  If  sets  of  weights  are  chosen  to  be 
orthogonal,  the  corresponding  filtered  components  will  be  orthogonal 
(although  not  necessarily  uncorrelated).  The  Vector  Filter  procedure  can 
result  in  a  reduction  of  the  contribution  of  overlapping  components, 
although_it  cannot  guarantee  their  complete  elimination.  Examples  of  the 
gains  in  component  identification  and  measurement  obtained  by  applying  the 
Vector  Filter  procedure  are  given  in  Gratton,  Kramer,  Coles,  and  Donchin 
(1987)  and  in  Fabiani,  Gratton,  Karis,  and  Donchin  (in  press). 

In  the  analysis  of  the  data  of  the  present  experiment  we  chose  sets  of 
weights  according  to  the  following  criteria.  First,  they  should  suppress 
the  contamination  due  to  the  contralateral  readiness  potential.  For  this 
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reason,  the  two  sets  of  weights  used  for  estimating  independently  the 
readiness  potential  on  each  side  were  selected  so  as  to  be  orthogonal  (i.e., 
their  cross-product  was  equal  to  0).  Second,  they  should  be  such  as  to 
suppress  the  influence  of  other  ERP  components.  In  particular,  we  filtered 
out  the  contribution  of  components  with  an  exclusively  midline  distribution. 
This  was  accomplished  by  giving  equal  weights  to  the  midline  electrodes  (Fz, 
Cz,  and  Pz).  Third,  to  eliminate  general  components  (with  an  equivalent 
influence  at  all  sites),  the  sum  of  each  set  of  weights  was  equal  to  0. 

The  Vector  Filters  used  were  the  following: 

(4a)  C3't  =  0.888*C3 1  -  0. 114*C4 '  -  0.258*Fz  -  0.258*Cz  -  0.258*Pz 

(4b)  C4 ' t  =  0.888*C4‘  -  0.114*C3'  -  0.258*Fz  -  0.258*Cz  -  0.258*Pz 

Note  that  the  sum  of  squares  of  each  set  of  weights  is  equal  to  1,  so 

that  the  Vector  Filter  amounts  to  a  rotation  of  axes  in  the  space  defined  by 

the  electrode  locations. 

To  provide  some  indication  of  the  validity  of  this  procedure,  we 
computed  the  average  correlation  across  trials,  over  the  six  subjects, 
between  C3 '  and  C4 '  (in  the  last  100  ms  before  array  presentation)  before 
and  after  applying  Vector  Filter.  As  can  be  seen  by  comparing  (3a)  and 
(3b),  the  "undesired"  components  (i.e.,  the  contralateral  readiness 
potential,  as  well  as  the  other  ERP  components)  will  produce  similar  effects 
on  C3 '  and  C4‘,  and  will  therefore  tend  to  increase  the  correlation  between 
the  potential  recorded  at  these  two  electrodes.  The  average  correlation  was 
.83  +.03  before  correction,  and  .00  +.08  after  correction.  Note  that  the 
latter  correlation  could  have  been  different  from  0  even  though  orthogonal 
sets  of  weights  were  used.  This  would  occur  if  the  procedure  under-  or 
over-estimated  the  effects  of  volume  conduction,  or  if  there  was  some 
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"structural"  or  "functional"  association  between  the  brain  electrical 
activity  generated  beneath  the  two  lateral  electrodes.  The  latter  might 
occur  under  conditions  of  "response  competition"  or  "response  energizing. 
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Figure  Legends 

Figure  1.  Conditional  accuracy  functions  (upper  panels)  and  proportion 
of  trials  for  each  response  latency  bin  (lower  panels)  for  compatible 
(solid)  and  incompatible  (dashed)  trials.  Values  on  the  ordinate  correspond 
to  the  midpoint  of  each  response  latency  bin.  Data  for  trials  classified  on 
the  basis  of  criterion  squeeze  response  are  shown  in  the  left  panels,  and 
for  trials  classified  on  the  basis  of  the  EMG  onset  in  the  right  panels. 

For  the  conditional  accuracy  functions  based  on  EMG  onset  latency,  standard 
errors  are  indicated  by  the  vertical  bars.  For  criterion  squeeze  response 
functions,  only  three  subjects  had  trials  in  the  100-149  ms  latency  bin. 

Figure  2.  Grand-average  ERP  waveforms  depicting  the  voltage  difference 
between  the  scalp  electrodes  contralateral  and  ipsi lateral  to  the  correct 
response  for  correct  (solid)  and  incorrect  (dashed)  response  trials,  from 
the  incompatible  condition.  The  waveforms  in  the  upper  panel  are  averages 
for  trials  with  response  latencies  between  300  and  349  ms,  the  waveforms  in 
the  lower  panel  refer  to  trials  with  response  latencies  between  150  and  199 
ms.  Upward  deflections  indicate  lateralization  (larger  negativity)  toward 
the  scalp  electrode  contralateral  to  the  correct  response,  downward 
deflections  indicate  lateralization  toward  the  scalp  electrode  ipsilateral 
to  the  correct  response  (and  contralateral  to  the  incorrect  response). 

Figure  3.  Conditional  accuracy  functions,  averaged  over  compatible  and 
incompatible  conditions,  as  a  function  of  the  lateralization  of  the 
readiness  potential  during  the  last  100  ms  of  the  foreperiod. 

Figure  4.  Upper  panel.  Conditional  accuracy  functions  for  compatible 
and  incompatible  trials.  Middle  panel.  ERP  waveforms  of  the  lateralized 
readiness  potential  following  array  presentation,  averaged  separately  for 
all  compatible  and  for  all  incompatible  trials.  Lower  panel.  ERP  waveforms 
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of  the  lateralized  readiness  potential  following  array  presentation,  for 
compatible  and  incompatible  correct  trials  with  a  response  latency  between 
300  and  349  ms. 

Figure  5.  ERP  waveforms  of  the  lateralized  readiness  potential  for  the 
period  beginning  100  ms  before  the  warning  tone  and  ending  at  the  time  of 
the  peripheral  response  to  the  array.  Separate  waveforms  are  shown  for 
compatible  correct  (upper  panel),  incompatible  correct  (middle  panel),  and 
incompatible  incorrect  trials  (lower  panel),  for  four  response  latency  bins. 
The  solid  vertical  lines  in  each  panel  indicate  the  response  latency  for 
each  bin.  The  upper  and  lower  horizontal  lines  in  each  panel  indicate  the 
inferred  thresholds  for  correct  and  incorrect  muscle  response  emission. 

Figure  6.  ERP  waveforms  representing  the  correct  (solid)  and  incorrect 
(dashed)  single-side  readiness  potentials  for  compatible  correct  trials. 
Separate  average  waveforms  are  shown  for  four  response  latency  bins.  The 
dashed  vertical  lines  after  array  presentation  indicate  the  response  latency 
for  each  bin.  The  upper  horizontal  line  in  each  panel  indicate  the  inferred 
threshold  for  muscle  response  emission. 
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(imp-la#  between  the  prrsentdtion  of  the  information  and  the  subsequent  control  instructed  to  tixdte  their  eyes  on  the  center  of  a  CRT  while  locussing  their 
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l'JHO)  Furthermore,  the  FRPs  h.ive  demonstrated  .1  sensitivity  to  the  direction  ol  prim.iry  t.i-'k  that  is  to  be  performed  as  well  as  possible  and  a  secondarv  t.i 

attention  independent  ot  eye  movements.  (J)n  the  basis  of  the  ERP  results,  it  wbi<  h  is  to  be  performed  to  the  extent  that  it  does  not  interfere  with  the 

appears  that  the  processing  of  stimulus  features  such  as  color,  si/eand  brightness  pertormunr  e  of  the  primary  task  It  is  assumed  that  the  demands  imposed  upon 

are  hierarchically  rlependent  upon  prior  selection  tor  location.  tt1(.  sub|e(t  bv  the  primary  task  will  he  relief ted  in  the  pertormam  e  on  the 

(he  “-tudies  described  thus  far  have  investigated  the  sensitivity  of  EKPs  to  secondary  task  Increasing  primary  task  difficulty  presumably  leads  to  an  in 
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HUMAN  FACTORS,  1987,  29(2),  145-160 

A  Psychophysiological  Assessment  of  Operator 
Workload  During  Simulated  Flight  Missions 

ARTHUR  F  KRAMER.1  ERIK  J  SIREVAAG,  and  ROLF  BRAUNE,  Institute  of  Aviation, 
Aviation  Research  Laboratory.  University  of  Illinois,  Champaign,  Illinois 


Pre\  tons  research  has  indicated  that  components  of  the  event-related  potential  (ERPf  may  be 
used  to  quantify  the  resource  requirements  of  complex  cognitive  tasks.  The  present  study  was 
designed  'u  explore  the  degree  to  which  these  results  could  ’  generalized  to  complex,  real- 
world  tasks.  The  study  also  examined  the  relations  among  performance-based,  subjective, 
and  psychophysiological  measures  of  operator  workload.  Seven  male  volunteers,  enrolled  in 
an  instrument  flight  rule  IlFR)  aviation  course  at  the  University  of  Illinois,  participated  in 
the  study.  The  student  pilots  flew  a  series  of  I FR  flight  missions  in  a  single-engine,  fixed- 
based  simulator.  In  dual-task  conditions  subjects  were  also  required  to  discriminate  be¬ 
tween  two  tunes  differing  in  frequency  and  to  make  an  occasional  overt  response.  ERPs 
time-lucked  to  the  tones,  subjective  effort  ratings,  and  overt  performance  measures  were 
collected  during  two  separate  45-min  flights  differing  in  difficulty.  The  difficult  flight  was 
associated  with  high  subjective  effort  ratings,  as  well  as  increased  deviations  from  the  com¬ 
mand  altitude,  heading,  and  glideslope.  The  P300  component  of  the  ERP  discriminated 
among  levels  of  task  difficulty,  decreasing  in  amplitude  with  increased  task  demands. 
Within-flight  demands  were  examined  by  dividing  each  flight  into  four  segments:  takeoff, 
straight  and  level  flight,  holding  patterns,  and  landings.  The  amplitude  of  the  P300  was 
negatively  correlated  with  deviations  from  command  headings  across  the  flight  segments.  In 
sum,  the  findings  provide  preliminary  evidence  for  the  assertion  that  ERP  components  can 
be  employed  as  metrics  of  resource  allocation  in  complex,  real-world  environments. 


INTRODUCTION 

The  importance  ot  the  explication  of 
mental  workload  in  operational  environ¬ 
ments  has  been  underscored  in  recent  years 
by  a  number  of  conferences  convened  to  ex¬ 
amine  the  topic  as  well  as  by  several  method¬ 
ological  and  theoretical  reviews  of  the  litera¬ 
ture  (Fra/ier  and  Crombie,  1982;  Gopher  and 

1  Requests  ten  reprints  should  be  sent  to  Arthur  F 
Kramer.  Department  ol  Psychology.  University  ol  Illinois 
at  Uibana-Champaign,  603  E  Daniel  St  ,  Champaign,  !L 
6IH20 


Donchin,  1986;  Moray,  1979;  O’Donnell  and 
Eggemeier,  1986,  Williges  and  Wierwille, 
1979).  Although  mental  workload  is  acknowl¬ 
edged  to  be  a  problem  in  many  manual  con¬ 
trol  and  supervisory  tasks,  there  has  been  a 
failure  to  reach  a  consensus  on  its  definition 
or  measurement.  This,  in  part,  is  due  to  the 
multidimensional  nature  of  the  concept  (Eg¬ 
gemeier,  1980;  Johannsen,  Moray,  Pew,  Ras¬ 
mussen,  Sanders,  and  Wickens,  1979). 

One  set  of  models  that  appear  to  offer  a 
useful  framework  in  which  to  conceptualize 
mental  workload  are  the  multiple-resource 


1  1 the  Human  Faclurs  Society,  Inc  All  rights  reserved. 
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models  of  uttentional  allocation  (Freidman 
and  Poison,  1981;  Navon  and  Gopher,  1979; 
Wickens,  1980,  1984).  In  these  models, 
human  capacity  is  represented  by  a  number 
of  finite  pools  of  resources  available  for  time¬ 
sharing  among  concurrently  performed 
tasks.  The  models  predict  that  tasks  that  re¬ 
quire  the  same  types  of  processing  resources 
will  be  more  poorly  time-shared  than  will 
tasks  that  require  different  resources. 
Wickens  (1980)  has  proposed  that  resources 
may  be  defined  by  three  dichotomous  dimen¬ 
sions;  stages  of  processing  (perceptual/cogni¬ 
tive  and  response),  codes  ot  processing 
(verbal  and  spatial),  and  modalities  of  pro¬ 
cessing  (auditory  and  visual).  Within  such  a 
framework,  mental  workload  can  be  described 
as  the  cost  ol  performing  one  task  in  terms  of 
a  reduction  in  the  capacity  to  perform  addi¬ 
tional  tasks,  given  that  the  two  tasks  overlap 
in  their  resource  demands. 

The  research  presented  here  derives  from 
an  extensive  series  of  investigations  that  have 
demonstrated  the  utility  ol  event-related 
brain  potentials  (ERPs)  in  the  assessment  ol 
residual  capacity  during  the  acquisition  and 
performance  of  a  variety  ot  perceptual-motor 
tasks  (Donchin,  Kramer,  and  Wickens,  1986; 
Kramer,  1987).  The  focus  ol  the  present  study 
was  to  employ  ERPs  in  conjunction  with 
measures  of  overt  performance  and  subjec¬ 
tive  indices  of  mental  workload  in  order  to 
monitor  changes  in  resource  demands  that 
occur  during  a  complex  real-world  task.  The 
task  involved  "flying"  an  instrument  flight 
rule  (IFR)  plan  in  a  single-engine  aircraft 
simulator. 

The  ERP  is  a  transient  series  of  voltage 
oscillations  in  the  brain  that  can  be  recorded 
from  the  scalp  in  response  to  the  occurrence 
of  a  discrete  event  (Donchin,  1975;  Regan, 
1972).  This  temporal  relationship  between 
the  ERP  and  the  eliciting  stimulus  or  re¬ 
sponse  is  what  differentiates  ERPs  from  the 
ongoing  electroencephalographic  activity. 


Thus,  the  electroencephalograph  (EEG)  pro¬ 
vides  a  measure  of  the  tonic  state  of  the  or¬ 
ganism,  whereas  ERPs  reflect  phasic  changes 
related  to  the  processing  of  specific  events. 

It  is  important  to  recognize  the  componen- 
tial  nature  of  the  ERP.  ERPs  have  generally 
been  viewed  as  a  sequence  of  separate  but 
sometimes  temporally  overlapping  compo¬ 
nents  that  are  influenced  by  a  combination  of 
the  physical  parameters  of  the  stimuli  and 
such  psychological  constructs  as  expectancy, 
task  relevance,  attention,  and  memory 
(Kramer,  1985).  Components  are  typically  la¬ 
beled  with  an  N  or  a  P,  denoting  negative  or 
positive  polarity,  and  with  a  number  indi¬ 
cating  their  minimal  latency  measured  from 
the  onset  of  an  eliciting  event  (e.g.,  N100  is  a 
negative-going  component  that  occurs  at 
least  100  ms  after  a  stimulus). 

ERP  components  may  be  categorized  along 
a  continuum  from  exogenous  to  endogenous. 
The  exogenous  components  represent  an 
obligatory  response  of  the  brain  to  the  pre¬ 
sentation  of  a  stimulus.  These  components 
are  usually  associated  w'ith  specific  sensory 
systems,  occur  within  200  ms  of  a  stimulus, 
and  are  primarily  sensitive  to  the  physical  at¬ 
tributes  of  stimuli.  For  example,  exogenous 
visual  potentials  are  influenced  by  the  inten¬ 
sity,  frequency,  hue,  patterning,  and  location 
of  the  stimulus  in  the  visual  field.  The  exoge¬ 
nous  components  have  been  successfully 
used  in  clinical  settings  to  monitor  the  func¬ 
tional  integrity  of  the  nervous  system  during 
surgical  procedures,  to  assess  changes  in  the 
nervous  system  as  a  result  of  maturation  and 
aging,  and  to  help  diagnose  various  types  of 
neuropathology  including  tumors,  lesions, 
and  demyelinating  diseases,  such  as  multiple 
sclerosis  (Starr,  1978;  Stockard,  Stockard, 
and  Sharbrough,  1979). 

The  endogenous  components,  on  the  other 
hand,  occur  somewhat  later  than  the  exoge¬ 
nous  components  and  are  not  very  sensitive 
to  changes  in  the  physical  parameters  of 
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stimuli,  especial  I  v  when  these  changes  are 
not  relevant  to  the  task.  Instead,  these  com¬ 
ponents  are  primarily  influenced  by  the  pro¬ 
cessing  demands  of  the  task  imposed  on  the 
subject.  In  tact,  endogenous  components  can 
even  be  elicited  by  the  absence  ot  a  stimulus 
if  this  "event"  is  relevant  to  the  subject's 
task.  The  strategies,  expectancies,  intentions, 
and  decisions  of  the  subject,  along  with  task 
parameters  and  instructions,  account  for  the 
majority  of  the  variance  in  the  endogenous 
components.  One  of  the  dependent  variables 
in  the  present  study,  the  P300,  is  a  typical  ex¬ 
ample  ol  an  endogenous  component.  (For  a 
comprehensive  review  of  the  P300,  see  Prit¬ 
chard,  1981 .) 

One  might  ask  why  ERPs  should  be  used  to 
monitor  changes  in  resource  demands,  given 
that  several  technically  simpler  approaches 
to  the  assessment  of  skill  acquisition  and 
mental  workload  have  already  been  imple¬ 
mented.  Although  numerous  performance- 
based  measures  of  mental  workload  exist, 
they  suffer  lrom  several  drawbacks.  First, 
some  of  the  measurement  techniques  require 
subjects  to  perform  a  secondary  task,  which 
frequently  inlertercs  with  the  performance  of 
the  task  of  interest  (Knowles,  1963;  Rolle, 
1971;  Wickcns,  1979).  This  is  clearly  unac¬ 
ceptable  in  an  operational  environment  in 
which  the  safety  ot  the  operator  must  be  as¬ 
sured.  Even  in  the  laboratory  setting  it  is  dit- 
t i cult  to  determine  which  of  the  two  tasks 
generated  an  observed  performance  decre¬ 
ment.  since  performance  on  the  two  tasks  is 
easily  confounded.  Second,  performance- 
based  measures  of  mental  workload  provide 
an  output  measure  ol  the  operator’s  informa¬ 
tion-processing  activities  (e  g.,  RT,  accuracy). 
Thus,  at  best,  performance  measures  provide 
only  an  indirect  index  of  cognitive  function. 
Third,  performance  measures  do  not  always 
correlate  highly  with  the  actual  workload  of 
the  tasks  (Brown,  1978;  Dornic,  1980;  Ogden, 
Lev  ine,  and  Eisner,  1979). 


For  example,  imagine  a  situation  in  which 
two  pilots  perform  a  series  of  difficult  flight 
maneuvers.  Under  normal  conditions,  both 
pilots  execute  the  maneuvers  with  a  high 
level  of  proficiency,  and  their  flight  perfor¬ 
mance  is  indistinguishable.  Is  this  to  say  that 
their  workload  is  also  equivalent?  Now 
imagine  the  same  two  pilots  flying  identical 
maneuvers  while  concurrently  attempting  to 
diagnose  the  cause  of  an  intermittent  engine 
problem.  Although  their  performance  is 
equivalent  under  normal  conditions,  one 
pilot  may  cope  adequately  with  the  addi¬ 
tional  demands,  whereas  the  second  pilot's 
flight  performance  may  deteriorate  as  he  or 
she  troubleshoots  the  abnormality.  This  pat¬ 
tern  of  results  would  suggest  that  the  second 
pilot  was  operating  under  higher  levels  of 
workload  even  during  the  normal  flight  con¬ 
dition. 

Several  recent  studies  have  illustrated  the 
usefulness  of  the  ERP,  and  more  specifically 
the  P300  component,  as  an  index  of  pro¬ 
cessing  resources  (Horst,  Munson,  and 
Ruchkin,  1984;  Isreal,  Chesney,  Wickens, 
and  Donchin,  1980;  Kramer,  Wickens,  and 
Donchin,  1983,  1985;  Natani  and  Gomer, 
1981;  Strayer  and  Kramer,  1986;  Wickens, 
Kramer,  Vanasse,  and  Donchin,  1983).  The 
general  paradigm  employed  in  these  studies 
requires  subjects  to  perform  two  tasks  con¬ 
currently.  One  task  is  designated  as  primary 
and  the  other  task  as  secondary.  Subjects  are 
instructed  to  maximize  their  performance  on 
the  primary  task  and  devote  any  additional 
resources  to  the  performance  of  the  sec¬ 
ondary  task. 

Primary  tasks  have  included  system  moni¬ 
toring,  decision  making,  and  manual  control. 
Secondary  tasks  have  required  subjects  to 
discriminate  between  tones  of  different  fre¬ 
quencies  or  lights  of  different  intensities.  In 
general,  the  response  demands  of  the  sec¬ 
ondary  probe  tasks  have  been  minimal,  re¬ 
quiring  subjects  either  to  covertly  count  the 
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total  number  of  one  type  ot  event  or  to  re¬ 
spond  to  an  occasional  target  probe. 

ERPs  are  elicited  bv  events  either  in  one  or 
in  both  of  the  tasks.  Increases  in  the  percep¬ 
tual/cognitive  difficulty  of  the  primary  task 
result  in  a  decrease  in  the  amplitude  of  the 
P300s  elicited  by  the  secondary  task.  Con¬ 
versely,  P300s  elicited  by  discrete  events  em¬ 
bedded  within  the  primary  task  increase  in 
amplitude  with  increases  in  primary  task  dif¬ 
ficulty.  Furthermore,  changes  in  response-re¬ 
lated  demands  of  a  task  have  no  influence  on 
the  P300  (Isreal  et  al„  1980). 

The  reciprocal  relationship  between  P300s 
elicited  by  primary  and  secondary  task 
stimuli  is  consistent  with  the  resource  trade¬ 
offs  presumed  to  underlie  dual-task  perfor¬ 
mance  decrements  (Kahneman,  1973;  Navon 
and  Gopher,  1979;  Sanders,  1979;  Wickens, 
1980).  That  is,  resource  models  predict  that 
as  the  difficulty  of  one  task  is  increased,  ad¬ 
ditional  resources  are  reallocated  to  that  task 
in  order  to  maintain  performance,  thereby 
depleting  the  supply  of  resources  that  could 
have  been  used  in  the  processing  oi  other 
tasks.  Thus,  the  P300  appears  to  provide  a 
measure  of  resource  trade-offs  that  can  only 
be  inferred  from  more  traditional  perfor¬ 
mance  measures.  Furthermore,  P300s  elic¬ 
ited  by  secondary  task  events  are  selectively 
sensitive  to  the  perceptual/cognitive  de¬ 
mands  imposed  on  the  operator.  This  selec¬ 
tive  sensitivity  may  be  especially  useful  in 
decomposing  the  changing  processing  re¬ 
quirements  of  complex  tasks  (Kramer  et  al., 
1983) 

The  goal  of  the  present  experiment  was  to 
augment  the  conclusions  drawn  from  the 
studies  cited  above  by  demonstrating  that 
the  dual-task  ERP  paradigm  could  be  em¬ 
ployed  in  a  complex  real-world  situation  to 
provide  information  concerning  mental 
workload  and  residual  capacity.  Student 
pilots  performed  a  series  of  dual-task  (light 
missions.  In  each  case,  the  primary  task  con¬ 


sisted  of  performing  a  specified  flight  sce¬ 
nario  under  IFR  flight  conditions.  The  diffi¬ 
culty  of  the  primary  task  was  varied  in  two 
ways.  Between-mission  difficulty  was  manipu¬ 
lated  by  varying  the  direction  and  speed  of 
wind  conditions,  the  severity  of  turbulence, 
and  the  probability  of  a  subsystem  failure 
during  a  critical  portion  of  the  mission.  A 
second  way  that  difficulty  was  manipulated 
might  be  labeled  within-mission  difficulty.  In 
this  case,  we  capitalized  on  the  different 
levels  of  processing  demand  inherent  in  the 
(light  task  (i.e.,  straight  and  level  versus  ap¬ 
proach  to  landing). 

The  secondary  task  consisted  of  a  concur¬ 
rently  performed  go/no-go  auditory  discrimi¬ 
nation  task,  in  which  subjects  pressed  a 
button  in  response  to  the  presentation  of  one 
of  two  tones.  ERPs  associated  with  the  sec¬ 
ondary  task  tones,  overt  performance  mea¬ 
sures  from  the  flight  task  and  discrimination 
task,  and  subjective  indices  of  task  difficulty 
were  examined  to  assess  the  extent  to  which 
the  manipulations  of  primary  task  difficulty 
modulated  the  mental  workload  associated 
with  the  flight  task. 

METHOD 

Subjects 

Seven  right-handed  male  volunteers  were 
paid  for  their  participation  in  the  study.  All 
of  the  subjects  were  student  pilots  enrolled  in 
an  aviation  course  at  the  University  of  Illi¬ 
nois'  Willard  Airport.  Prerequisites  for  the 
course  include  two  semesters  of  basic  visual 
flight  rule  (VFR)  training,  the  possession  of  a 
private  pilot’s  license,  and  two  introductory 
courses  in  IFR  flight  skills.  Thus,  the  student 
pilots  were  proficient  in  VFR  flight  skills  and 
had  a  basic  familiarity  with  both  IFR  flight 
skills  and  precision  landing  approach  tech¬ 
niques.  All  subjects  were  between  the  ages  of 
20  and  26,  and  had  normal  hearing  and 
normal  or  corrected-lo-normal  vision. 
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Simulator  and  Stimulus 
Generation  Equipment 

The  ILLIMAC  flight  system  used  in  the  ex¬ 
periment  consisted  of  a  fixed-based  flight 
simulator  that  was  designed  around  the 
INTEL  8086  digital  microprocessor.  The  sim¬ 
ulator  was  developed  at  the  Institute  of  Avia¬ 
tion  at  the  University  of  Illinois.  The  flight 
equations  were  produced  by  the  16-bit  8086 
microcomputer.  The  digital  information  w'as 
converted  to  analog  voltages  to  drive  the  sim¬ 
ulator  instrumentation.  In  the  present  study, 
the  simulator  was  configured  to  mimic  the 
flight  characteristics  of  the  Beechcraft  Sport 
180,  a  single-engine  aircraft  with  fixed  gear 
and  a  fixed  propeller.  The  ILLIMAC  flight 
panel  contained  the  instrumentation  and  na¬ 
vigational  radios  required  for  instrument 
flight  conditions.  Flight  performance  mea¬ 
sures  were  digitized  at  the  rate  of  30  Hz  and 
were  transferred  via  an  RS232  link  to  a  DEC 
PDP  11/73  human-performanee/eleetrophysi- 
oiogical  laboratory  computer  (Heffley,  Foote, 
Mui,  and  Donehin,  1985). 

The  auditory  stimuli  employed  for  the  sec¬ 
ondary  task  were  produced  by  an  audio-gen¬ 
erator  and  binaurally  presented  to  the  pilots 
through  headphones.  ERPs,  flight  perfor¬ 
mance  data,  and  secondary  task  RTs  were  re¬ 
corded  on  magnetic  tape  lor  off-line  analysis. 

Tasks 

A  graphic  illustration  of  the  basic  flight 
task  is  presented  in  Figure  1 .  The  flight  began 
on  Runway  32  at  Willard  Airport.  The  pilot 
was  instructed  to  climb  to  3000  feet  at  a  con¬ 
stant  rate  of  500  feet  per  minute  and  to  inter¬ 
cept  the  062  radial  from  the  Champaign 
very-high-frequency  omnidirectional  radar 
beacon  (VOR)  after  turning  to  a  heading  of 
090  deg.  At  the  12  mile  distance  measuring 
equipment  point  (DME),  the  pilot  was  in¬ 
structed  to  roll  30  deg  to  the  left  and  fly  one 
minute  outbound  on  a  032  heading  in  order 


Figure  I .  A  schematic  illustration  of  the  45-min 
flight  mission  performed  by  the  student  pilots. 


to  set  up  a  holding  pattern  with  an  inbound 
heading  of  242  deg.  Three  standard  holding 
patterns  were  flown  prior  to  continuation  of 
the  flight.  Next,  the  pilots  were  instructed  to 
track  the  216  radial  direct  to  the  Veals  (the 
outer  marker  for  the  ILS  approach  to 
Runway  32  at  Champaign)  non-directional 
beacon  (NDB),  maintaining  an  airspeed  of 
135  knots.  At  the  6-mile  DME  point  (Veals), 
the  pilots  were  to  turn  outbound  in  order  to 
intercept  the  localizer  inbound  to  Runway 
32.  At  the  8.2-miie  DME  point,  a  turn  was  to 
be  made  to  a  091  heading.  After  flying  for  two 
minutes  outbound,  the  pilot  executed  a  pro¬ 
cedure  turn  to  intercept  the  localizer  in¬ 
bound  to  Runway  32.  When  the  localizer  was 
intercepted,  the  pilot  tracked  316  deg  at  2600 
feet  towards  the  middle  marker,  where  the 
flight  was  terminated.  Performance  mea¬ 
sures  included  deviations  from  assigned 
heading,  airspeed,  altitude,  and  glideslope. 

This  IFR  flight  plan  — roundtrip  from 
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Champaign  to  Octoe  intersection  —  formed 
the  framework  within  which  the  processing 
demands  imposed  upon  the  student  pilots 
were  investigated.  (See  Champaign  approach 
plate  AL-709,  ILS  Runway  32  for  additional 
details.)  Thus,  the  flight  segments  consti¬ 
tuted  the  primary  task  in  this  experiment, 
and  subjects  were  instructed  to  maximize 
their  performance  of  the  flight  missions. 

In  addition  to  the  primary  task,  subjects 
performed  a  concurrent  secondary  task.  This 
task  required  the  student  pilots  to  monitor  a 
Bernoulli  sequence  of  auditory  stimuli  pre¬ 
sented  binaurally  through  headphones.  Two 
different  tone  frequencies  (1000  Hz  and  1500 
Hz)  were  used.  The  1000-Hz  tone  was  desig¬ 
nated  as  the  target  tone  for  four  subjects;  for 
the  remaining  three  subjects,  the  1500-Hz 
tone  was  designated  as  the  target.  Targets 
were  presented  on  30%  of  the  trials.  Subjects 
were  instructed  to  respond  to  targets  by  de¬ 
pressing  a  switch  located  on  the  left  side  of 
the  control  yoke.  The  response  required  only 
a  minimal  movement  of  the  thumb.  Both 
speed  and  accuracy  were  emphasized  in  the 
instructions.  Non-targets,  which  were  pre¬ 
sented  70%  of  the  time,  did  not  require  a  re¬ 
sponse.  Tones  were  50  ms  (including  a  10-ms 
rise  fall  time,  65  dB)  in  duration,  and  were 
presented  every  1.4  to  1.7  s. 

[  RP  Recurdin g  System 

Electroencephalographic  (EEG)  activity 
was  recorded  Irom  three  midline  sites  (Fz,  Cz, 
and  Pz  according  to  the  International  10/20 
system;  see  Jasper,  1958)  and  referred  to 
linked  mastoids.  Beckman  Biopotential  Ag- 
AgCl  electrodes  filled  with  Grass  electrode 
paste  were  attached  to  all  scalp  sites  and  also 
to  a  forehead  ground.  In  addition,  identical 
electrodes  were  placed  above  and  below  the 
subject’s  right  eye  to  evaluate  electrooculo- 
graphic  (EOG)  activity  in  the  vertical  plane. 
All  electrode  impedances  were  maintained 
below  10  kohms. 


The  EEG  and  EOG  channels  were  ampli¬ 
fied  by  Grass  Model  12A5  amplifiers  with  a 
10-s  time  constant  and  an  upper  half  ampli¬ 
tude  of  35  Hz,  3-dB/octave  rolloff.  The  re¬ 
cording  epoch  for  all  channels  was  1300  ms, 
beginning  100  ms  prior  to  the  presentation  of 
secondary  task  tones.  The  data  channels  were 
digitized  every  5  ms  and  were  digitally  fil¬ 
tered  off-line  ( -  3  dB  at  6.27  Hz;  0  dB  at  14.29 
Hz)  prior  to  further  analysis.  Artifactual  con¬ 
tributions  to  the  EEG  from  EOG  activity 
were  evaluated  and  eliminated  off-line  by 
submitting  the  data  to  an  eye-movement  cor¬ 
rection  procedure  (Gratton,  Coles,  and  Don- 
chin,  1983). 

Procedure 

Each  subject  flew  a  total  of  four  45-minute 
missions  in  the  fixed-base  ILLIMAC  flight 
simulator.  In  the  first  session,  the  students 
flew  the  flight  course  twice.  These  flights 
served  to  familiarize  the  subjects  with  the 
IFR  flight  plan  and  the  dynamics  of  the  simu¬ 
lator.  Both  of  the  missions  were  flown  under 
the  easy  flight  conditions  (no  w'ind,  turbu¬ 
lence,  or  subsystem  failures).  Since  the 
flights  were  considered  as  practice,  the  per¬ 
formance  data  will  not  be  presented  here. 

In  the  second  session,  subjects  again  flew 
the  flight  path  illustrated  in  Figure  1.  How¬ 
ever,  in  this  session  one  of  the  two  flights  in¬ 
cluded  30-mile/h  winds  from  270  deg,  mod¬ 
erate  turbulence,  and  a  partial  suction  failure 
in  the  heading  indicator  during  approach  to 
landing.  The  .presentation  order  of  the  easy 
and  difficult  flights  in  the  second  session  was 
counterbalanced  across  subjects. 

Prior  to  the  flights,  subjects  were  presented 
with  a  short  block  (30  trials)  of  tones  in  order 
to  familiarize  them  with  the  tone  frequencies 
among  which  they  were  subsequently  asked 
to  discriminate.  Subjects  then  performed  a 
short,  single-task  flight  segment  in  which 
they  flew  a  straight  and  level  course  for  five 
minutes  under  calm  atmospheric  conditions 
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(no  wind  or  turbulence).  This  task  was  in¬ 
cluded  to  provide  a  baseline  for  subsequent 
subjective  ratings  of  task  difficulty.  Subjects 
were  instructed  to  assign  a  value  of  100  to 
this  flight  segment  and  to  compare  each  of 
the  flight  segments  in  the  easy  and  difficult 
flights  with  this  baseline  when  assigning 
their  subjective  difficulty  ratings  (Gopher 
and  Braune,  1984).  Subjects  then  performed 
the  auditory  discrimination  task  alone  for 
200  trials.  This  condition  was  included  to 
provide  single-task  estimates  of  performance 
and  ERPs  on  the  secondary  task. 

Following  the  discrimination  task,  subjects 
performed  the  flight  tasks  concurrently  with 
the  lone-discrimination  task.  A  certified 
flight  instructor  (CFI)  was  present  during 
each  flight  to  instruct  the  subjects  on  the 
flight  scenarios  and  to  evaluate  their  perfor¬ 
mance.  Upon  completion  of  each  of  the  two 
flights,  subjects  were  asked  to  rate  the  diffi¬ 
culty  of  the  flight  as  a  whole,  as  well  as  each 
of  the  individual  flight  segments.  Each  of  the 
(light  missions  lasted  approximately  45  min. 
Subjects  received  a  15-min  rest  break  be¬ 
tween  flights. 

Finally,  subjects  again  performed  200  audi¬ 
tory  discriminations  under  single  task  condi¬ 
tions.  Because  the  ERPs  and  performance 
data  from  the  two  single  task  discrimination 
conditions  were  not  significantly  different, 
they  were  averaged  together  prior  to  further 
analvsis. 

RESULTS  AND  DISCUSSION 

Flight  Performance  and  Probe 
Discrimination  Data 

The  llighl  performance  data  were  collected 
to  assess  the  validity  of  the  difficulty  manip¬ 
ulations,  both  within  and  across  missions. 
The  flight  scenario  has  been  described  in 
terms  of  the  mission  requirements  (see 
Figure  I ).  For  purposes  of  the  statistical  anal¬ 
vsis.  the  mission  was  partitioned  into  four 


flight  segments,  on  the  basis  of  common  op¬ 
erational  demands  (as  judged  by  a  panel  of 
CFIs).  Segment  A  comprised  both  the  takeoff 
and  the  preparation  for  final  approach  to 
landing.  The  two  straight  and  level  flight 
segments  were  combined  into  Flight  Seg¬ 
ment  B,  and  the  three  holding  patterns  into 
Segment  C.  The  final  approach  to  landing 
and  flight  along  the  glideslope  were  com¬ 
bined  into  Segment  D. 

Two  measures  of  flight  performance, 
heading  and  altitude  deviation,  were  re¬ 
corded  in  all  flight  segments  in  both  easy  and 
difficult  missions.  These  indices  were  sub¬ 
mitted  to  a  two-way  repeated  measures  anal¬ 
ysis  of  variance.  One  additional  measure,  de¬ 
viation  from  the  glideslope,  was  recorded  in 
both  the  easy  and  difficult  scenarios  in  the 
final  flight  segment.  Table  1  presents  the 
mean  values  of  the  flight  performance  mea¬ 
sures  for  both  easy  and  difficult  flights. 

All  three  of  the  flight  performance  mea¬ 
sures  indicated  that  our  between-mission  ex¬ 
perimental  manipulations  successfully  in¬ 
fluenced  the  difficulty  of  the  flight  task. 
When  the  student  pilots  were  required  to  fly 
the  45-min  mission  with  high  w'inds,  mod¬ 
erate  turbulence,  and  a  subsystem  failure 
during  approach  to  landing,  their  deviations 
from  command  altitude  increased,  F(l,6)  = 
6.6,  p  <  0.05;  their  ability  to  track  the  glide- 
slope  accurately  decreased,  F(  1,6)  =  8.0,  p  < 
0.05;  and  their  deviations  from  assigned 
headings  increased,  F(l,6)  =  9.1,  p  <  0.05; 
relative  to  the  mission  flown  under  easy 
flight  conditions.  Performance  was  also  in¬ 
fluenced  bv  mission  segment,  irrespective  of 
between-mission  difficulty  (see  Table  2). 
Subjects  were  more  accurate  at  maintaining 
their  assigned  headings  in  Segments  B  and  C 
than  they  were  in  Segments  A  and  D,  indi¬ 
cating  that  takeoff  and  landing  were  more 
difficult  for  the  students  than  was  straight 
and  level  flight  or  holding  patterns,  F(3,18)  = 
4.3,  p  <  0.05.  However,  students’  perfor- 
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mance  on  the  altitude  measure  did  not  differ 
as  a  function  of  (light  segment.  Furthermore, 
no  significant  interactions  were  obtained  for 
any  of  the  flight  measures.  Thus,  flight  per¬ 
formance  measures  most  strongly  discrimi¬ 
nated  between  easy  and  diilicult  flights, 
whereas  intersegment  differences  were  found 
for  only  a  subset  of  the  performance  mea¬ 
sures. 

Performance  measures  for  the  ERP-elici- 
ting  probe  task  are  presented  in  Table  1  for 
the  between-mission  comparison  and  in 
Table  2  for  the  comparison  across  flight  seg¬ 
ments.  Subjects  were  uniformly  accurate 
across  missions  and  flight  segments,  with  re¬ 
sponse  accuracy  ranging  from  89%  to  93%. 
Neither  RTs  nor  accuracies  differed  as  a 
function  of  flight  segments  or  missions  (p  > 
0.05).  Thus,  any  differences  among  the  ERPs 
elicited  in  different  flight  conditions  cannot 
be  attributed  to  the  subjects'  failure  to  per- 
lorm  the  probe  task  in  the  more  diilicult 
flight  missions  or  segments. 

Subjective  Workload  Ratings 

Table  I  presents  the  subjective  workload 
ratings  for  the  between-mission  compar¬ 
isons,  and  Table  2  displays  the  ratings  tor 
each  of  the  flight  segments.  The  subjective 
ratings  were  collected  in  a  manner  described 
bv  Gopher  and  Braune  (1984),  Subjects  ini¬ 
tially  flew  a  5-min  straight  and  level  flight 


path  and  were  instructed  to  assign  this  seg¬ 
ment  a  workload  rating  of  100.  Each  of  the 
flight  segments  in  the  easy  and  difficult 
flights  w-as  then  rated  relative  to  the  straight 
and  level  segment.  Subjects  made  their 
ratings  after  each  mission  and  were  per¬ 
mitted  to  assign  any  numerical  value  to  their 
estimates  of  subjective  workload.  The  ratings 
were  normalized  prior  to  statistical  analysis. 

Subjects  rated  the  flight  mission  with  high 
winds,  moderate  turbulence,  and  a  partial 
suction  failure  in  the  heading  indicator 
during  instrument  landing  system  (ILS)  ap¬ 
proach  as  having  a  significantly  higher  work¬ 
load  than  the  flight  without  wind,  turbu¬ 
lence,  or  subsystem  failures,  F(]  ,6)  =  18.7,  p 
<  0.01.  Subjective  ratings  also  discriminated 
among  flight  segments,  /-"( 1,6)  =  6.0,  p  < 
0.01.  Moreover,  the  subjects  estimated  Seg¬ 
ments  A,  C,  and  D  to  be  equally  difficult, 
whereas  Segment  B  —  the  straight  and  level 
portion  of  the  flight  —  was  estimated  to  be 
easier  than  the  other  three  segments,  F(l,6) 
=  9.8,  p  <  0.01.  No  significant  interactions 
were  obtained  for  the  workload  measures. 

A  comparison  of  the  flight  performance 
measures  and  the  subjective  workload 
ratings  suggests  that,  for  the  most  part,  the 
pilots’  subjective  estimates  corresponded 
well  with  their  performance  on  the  flight 
task.  Both  the  performance  measures  and  the 
subjective  ratings  discriminated  between 


TABLE  1 

Mean  land  Standard  Dewationl  Simulator  Performance,  Probe  Discrimination  Reaction  Time  and  Accu¬ 
rate.  and  Subjective  Workload  Ratings  (or  Ease  and  Difficult  Flights 


Flight  Missions 


Measures 


Easy  Flight 


Diilicult  Flight 


Heading  deviation  (deg) 

Altitude  deviation  (feet) 

Subjective  workload  ratings 
Probe  reaction  time  (ms) 

Probe  accuracy  (percentage  correct) 
Glideslope  deviation  (deg) 


1  86  (0  67) 
40  3  (17  8) 
115  2  (12  7) 
580  0  (139  5) 
92  4  (6  0) 

0  35  (0  21) 


3.08  (0  90) 
70  9  (37.8) 
137  8  (21.6) 
604  0  (119.3) 
89  0  (7.2) 

0  74  (0  30) 
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TABLE  2 


Mean  (and  Standard  Deviation)  Simulator  Performance,  Probe  Discrimination  Reaction  Time  and  Accu¬ 
racy  and  Subjective  Workload  Ratines  tor  the  Four  Flight  Segments 

Flight  Segments 


Measures 

Segment  A 

Segment  B 

Segment  C 

Segment  D 

Heading  deviation 
(deg) 

2.89  (1.03) 

1  87  (1.06) 

1.92  (0.81) 

3.19  (1.44) 

Altitude  deviation 
(feet) 

53.6  (32  2) 

60.1  (12.5) 

41.9  (11.2) 

67  4  (58.6) 

Subjective  workload 
ratings 

128  6  ( 18.5) 

117.2  (10.1) 

128.4  (13.1) 

133.0  (21.7) 

Probe  reaction  time 
(ms) 

604  0  (101) 

564.0  (135) 

584.0  (127) 

617.0  (146) 

Probe  accuracy 
(percentage  correct) 

89  4  (7  3) 

93  1  (6  0) 

92  2  (6  4) 

89.6  (6.4) 

easy  and  difficult  missions  Flight  segments 
were  also  differentiated  bv  subjective  and 
objective  measures.  However,  m  this  case  the 
heading  deviation  measure  indicated  that  the 
students’  performance  in  both  straight  and 
level  Might  and  holding  pattei  ns  was  superiot 
to  their  performance  during  takeoff  and 
landing,  whereas  sub|cciive  workload  ratings 
suggested  that  holding  patterns  were  per¬ 
ceived  to  be  as  difficult  as  takeoffs  and  prepa¬ 
ration  lor  landing 

Event- Related  Potential', 

Figure  2  presents  the  grand  aveiage  ERIN 
elicited  bv  the  target  tones  in  l fie  single-  and 
dual-task  conditions.  In  Figure  3.  the  ERIN 
recorded  at  P/  are  overplotlcd  for  t he  three 
conditions.  Three  different  components  can 
be  discerned  bv  visual  inspection  of  the 
waveforms.  The  earliest  is  a  frontally  max¬ 
imal.  negative-going  deflection  that  occurs 
between  100  and  200  rns  post-stimulus  ERP 
components  are  traditionally  defined  in 
terms  ol  their  latency  relative  to  a  stimulus 
or  response,  scalp  distribution,  and  sensi¬ 
tivity  to  experimental  manipulations  (Don- 
chin,  Ritter,  and  McCailum,  1978;  Sutton 
and  Ruchkin,  1984)  Bv  virtue  ol  its  latency 
and  scalp  distribution,  this  component  can 


be  defined  as  the  N100.  A  second  frontally 
maximal  negative  component  is  also  visible 
in  the  waveform  This  component  reaches  its 
peak  negativity  in  the  200-  to  300-ms  latency 
range,  and  will  henceforth  be  relerred  to  as 
the  N200  A  large  positive-going  deflection 
peaking  between  300  and  500  ms  posl-slim- 
ulus  can  also  be  identified  bv  visual  inspec¬ 
tion  of  the  waveforms  This  component  in- 
c teases  in  amplitude  from  the  frontal  to  the 
parietal  recording  site  and  appears  to  dis¬ 
ci  miniate  among  levels  ol  task  difficulty.  The 
component  will  be  labeled  the  P300. 

The  single  trial  ERPs  were  corrected  for 
eve  movement  artilacts  and  then  averaged 
within  experimental  conditions  The  three 
F. R P  components  described  above  were 
quantified  bv  measures  of  peak  latency  and 
amplitude  For  the  P300  component,  the  la- 
tenev  measure  was  obtained  Irom  the  single 
trials  The  amplitude  measure  ol  the  P300 
and  the  latency  and  amplitude  measures  for 
the  other  ERP  components  were  derived 
from  the  average  waveforms.  Area  measures 
lor  the  N100,  N200,  and  P3u0  components 
were  obtained  from  the  100  to  200,  200  to 
.300,  and  300  to  500  ms  latency  ranges,  re¬ 
spectively.  Peak  latencies  were  defined  as  the 
largest  positive  or  negative  deflection  within 
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Figure  2  ERP s  averaged  across  sub/ eels  for  the  lone 
discrimination  task  and  both  of  the  flight  missions 

the  predefined  latency  ranges.  Two  separate 
analyses  were  performed  on  the  ERP  data.  In 
the  between-mission  analysis,  ERP  compo¬ 
nent  measures  were  submitted  to  three-way. 


Figure  3.  Parietal  grand  average  ERPs  overplotted 
for  the  tone  discrimination  task  alone  and  for  both  of 
the  flight  missions 


repeated-measures  analyses  of  variance  (3 
Single- Dual  Tasks  x  2  Probe  Types  x  3  Elec¬ 
trodes).  Three  factors  were  also  entered  into 
the  within-flighl  analyses  oi  variance  (4 
Flight  Segments  x  2  Probe  Types  x  3  Elec¬ 
trodes). 

P300  Component 

Further  support  tor  the  identification  of  the 
large  positive  deflection  with  the  P300  com¬ 
ponent  was  provided  bv  the  signilicant  main 
effects  tor  electrode,  F(l,&)  =  7.9,  p  <  0  05, 
and  probe  type,  F(  1 ,0)  =  9.8,  p  <  0.05.  P300s 
were  larger  lor  the  low -probability  tone  than 
they  were  for  the  high-prooability  tone.  The 
amplitude  of  the  P300  increased  from  the  F z 
to  the  Cz  to  the  Pz  electrode  site.  Both  of 
these  effects  are  consistent  with  previous 
findings  and  have  been  used  as  definitional 
criteria  lor  the  P300  (Donchin  et  al.,  1978; 
Kramer,  1985).  One  of  the  major  questions  in 
the  present  study  was  the  extent  to  which 
P300  amplitude  would  discriminate  among 
levels  of  workload  imposed  on  the  student 
pilots  by  the  flight  tasks  The  main  effect  of 
flight  mission  indicated  that  P300  amplitude 
was  sensitive  to  the  task  demands  of  the  dif- 
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tereni  missions.  /-T 2.12)  -  4.7,  p  <  0.05.  Post 
hoc  comparisons  iui ther  indicated  that  P300s 
elicited  bv  the  tones  in  the  discrimination 
task  were  largest  in  the  single-task  condi¬ 
tions;  ol  intermediate  amplitude  when  the 
students  were  living  with  no  wind,  turbu¬ 
lence,  or  subs\ stem  (allures;  and  smallest 
with  high  winds,  turbulence,  and  a  heading 
indicator  Intlurc  (lor  all  comparisons,  p  < 
0.05;  see  Table  3).  This  systematic  decrease  in 
the  amplitude  ot  the  P300s  elicited  by  the 
tone  discrimination  task  alone  and  combined 
with  the  (light  missions  mimics  the  resource 
trade-oils  presumed  to  underlie  multitask 
performance. 

Additional  support  lor  the  sensitivity  ol 
P300  to  changes  in  the  resource  demands  ot  a 
task  was  found  m  an  analysis  ot  the  ellects  ot 
stimulus  sequence  on  the  P300s  elicited  in 
single-  and  dual-task  conditions  Previous 
studies  have  shown  that  there  is  a  systematic 
lelatiunship  between  the  amplitude  ot  the 
P300  elicited  bv  a  stimulus  on  a  given  trial 
and  the  sequence  ot  stimuli  that  precede  it 
(Johnson  and  Doiichm.  1978.  Squires.  Pelu 
i  how  ski ,  Wickens.  and  Donclim,  1977; 
Squires,  Wickens.  Squires,  and  Donchin, 
1*761  When  a  particular  trial  is  preceded  bv 
the  alternate  stimulus  tin  the  present  experi¬ 
ment.  a  low  tone  tollowed  bv  a  high  tone,  or 
vice  versa)  a  large  P300  ts  elicited  However. 


when  a  given  trial  represents  a  repetition  of  a 
previous  stimulus,  a  relatively  small  P300  is 
obtained.  A  model  proposed  to  account  for 
this  "sequential"  effect  has  emphasized  the 
importance  of  three  controlling  factors:  (1) 
the  global  probability  of  each  event,  (2)  the 
specific  structure  of  the  prior  sequence,  and 
(3)  the  memory  for  event  frequency  in  the 
prior  sequence  (Squires  et  al.,  1976).  Within 
the  domain  of  complex,  real-world  tasks,  we 
might  expect  that  as  the  resource  demands  of 
a  task  increase,  the  memory  capacity  avail¬ 
able  to  maintain  the  sequential  effect  might 
decrease.  This,  in  turn,  should  decrease  the 
difference  between  the  amplitude  of  the 
P300s  elicited  by  alternations  and  repeti¬ 
tions. 

Figure  4  presents  the  grand  average  ERPs 
elicited  by  stimulus  alternations  and  repeti¬ 
tions  for  single-  and  dual-task  conditions.  As 
can  be  seen  from  the  figure,  there  is  a  large 
difference  between  the  amplitude  of  the 
P300s  m  the  tone  discrimination  task.  Both 
the  overall  amplitude  as  well  as  the  differ¬ 
ence  between  the  alternation  and  repetition 
P300s  appears  to  decrease  when  the  tone  dis- 
enmination  task  and  flight  task  are  per¬ 
formed  concurrently.  Further  decreases  in 
amplitude  are  apparent  when  comparing  the 
easy  and  difficult  conditions. 

The  differences  between  these  conditions 


I  \  H 1  h  t 

Mean  land  Siandaid  Deviation)  NIUO  and  P100  Amplitude  (Arbitrary  Units)  and  Latencies  (ms)  fur  the 
Tone  Dim. i  i ruination  Alone  and  Combined  with  the  Two  Flight  Missions 


Flight  Missions 


Measures 


Single  Task  Easy  Flight  Difficult  Flight 


'MOO  amplitude 
'MOO  latency 
"P300  amplitude 
"P300  latency 


1139(480)  650(268)  650  (235) 

126  (17)  121  (12)  122  (13) 

1954(903)  902(516)  482(536) 

354(26)  441  (63)  468(62) 


‘  Ootamed  from  F/  electrode  site 
“  Octa-ned  from  P/  electrode  s*te 
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Difficult  Mission 


Figure  4.  Parietal  grand  average  ERPs  elicited  by 
stimulus  repetition  and  alternation  for  single-  and 
dual-task  conditions. 


were  quantified  by  obtaining  measures  of  the 
amplitude  ol  the  P300  and  submitting  these 
values  to  a  three-way,  repeated-measures 


analysis  of  variance  (2  Stimulus  Sequences 
x  3  Single/Dual  Tasks  x  2  Probe  Types). 
Main  effects  were  obtained  for  stimulus  se¬ 
quence,  TO, 6)  =  25.9,  p  <  0.01,  and  task, 
F(2,12)  =  4.2,  p  <  0.05.  More  interesting, 
however,  was  the  significant  interaction  be¬ 
tween  task  and  sequence,  F(2,12)  =  4.0,  p  < 
0.05,  which  suggested  that  the  overall  ampli¬ 
tude  as  well  as  the  difference  between  alter¬ 
nations  and  repetitions  decreased  with  in¬ 
creasing  task  difficulty.  Moreover,  post  hoc 
comparisons  indicated  that  although  the  dif¬ 
ference  between  alternations  and  repetitions 
was  significant  for  the  tone-discrimination 
task  and  the  easy  flight,  alternation  and  repe¬ 
tition  P300s  did  not  differ  significantly  in  the 
difficult  flight  condition.  Thus,  since  the 
global  probability  and  the  sequential  struc¬ 
ture  of  the  ERP  eliciting  tones  was  uniform 
across  conditions,  these  results  may  impl. 
that  residual  memory  capacity  decreased 
with  increased  task  difficulty. 

A  main  effect  of  flight  mission  was  also 
found  for  the  P300  latency  variable,  F(3,18) 
=  1 1 .3,  p  <  0.01 .  Post  hoc  comparisons  indi¬ 
cated  that  this  effect  could  be  attributed  to 
the  significant  difference  between  single  and 
dual  tasks,  (p  <  0.05).  However,  latencies  did 
not  differ  between  the  two  flight  missions. 
This  finding  suggests  that  the  difference  in 
P300  amplitude  between  the  two  flight  mis¬ 
sions  cannot  be  explained  by  increased  la¬ 
tency  variability  in  the  difficult  flight.  Given 
that  single-task  P300  latencies  are  often 
shorter  than  dual-task  latencies,  it  appears 
safe  to  conclude  that  the  P300  component 
successfully  discriminated  between  single 
and  dual  tasks  as  well  as  between  the  two 
versions  of  the  flight  task. 

Thus  far  we  have  described  the  effects  of 
between-mission  difficulty  on  the  amplitud 
and  latency  of  the  P300  component.  Our 
second  analysis  compares  ERP  components 
elicited  during  the  four  flight  segments.  It 
was  predicted  that  the  P300s  elicited  by  the 
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tunes  in  the  more  difficult  flight  segments 
would  be  smaller  than  those  recorded  during 
the  easier  Might  segments,  reflecting  in¬ 
creased  processing  demands  in  the  more  dif¬ 
ficult  conditions.  Although  the  within-mis- 
sion  comparisons  yielded  weaker  ellects  lor 
the  performance  and  subjective  measures 
than  did  the  between-mission  analysis,  an  or¬ 
dering  of  the  Might  segments  could  be  ascer¬ 
tained.  The  straight  and  level  Might  segment 
(B)  and  the  holding  pattern  (C)  were  Mown 
with  smaller  heading  deviations  than  the 
takeoff  (A)  and  landing  (D)  components  ot  the 
missions.  The  student  pilots  also  rated  the 
straight  and  level  segment  to  be  subjectively 
easier  than  the  other  three  segments.  Al¬ 
though  the  ordering  ot  the  P300  amplitudes 
was  consistent  with  these  measures  (the 
mean  amplitudes  ol  Segments  A  through  D 
were  209,  525,  508,  and  283,  respectively),  the 
main  eMect  lor  Might  segment  did  not  attain 
statistical  significance  (p  >  0.05).  However,  a 
small  but  significant  correlation  was  ob¬ 
tained  between  the  amplitude  ol  the  P300 
and  the  deviaiion-ti om-command  heading  ( R 
-  -0.27),  indicating  that  the  amplitude  of 
P300  decreased  with  increases  in  heading  de¬ 
lation.  The  latency  ol  P300  did  not  ditler 
across  Might  segments. 

.Vcgu/hv  Component * 

Two  dillerent  negative  components  are  ap¬ 
parent  m  the  waveforms  presented  in  Figure 
I.  The  earliest  is  a  Irontallv  maximal  deflec¬ 
tion,  which  occurs  between  100  and  200  ms 
post-stimulus.  This  component,  labeled  the 
NI00,  has  been  found  to  be  sensitive  to  the 
allocation  ol  attention  to  physical  attributes 
ol  stimuli  such  as  frequency,  loudness,  and 
location.  NIOOs  are  larger  lor  any  stimulus 
possessing  the  attended  attribute  than  they 
are  for  stimuli  possessing  other  attributes 
(Hillvard  and  Hansen,  1986).  The  amplitude 
ol  the  N100  is  also  influenced  bv  the  num¬ 
ber  of  information  sources  that  must  be  at¬ 


tended  simultaneously  (Parasuraman,  1976; 
Schwent  and  Hillyard,  1975).  NIOOs  decrease 
in  amplitude  as  subjects  are  required  to 
monitor  additional  sources  ol  information. 

In  the  present  experiment,  an  important 
question  concerns  the  degree  to  which  the 
N100  reflects  the  transition  Irom  single  to 
dual  tasks,  as  well  as  the  increase  in  work¬ 
load  imposed  on  the  student  pilots  by  the 
more  difficult  flight  task.  To  this  end,  mea¬ 
sures  of  N100  amplitude  and  latency  were 
submitted  to  a  three-way,  repeated-measures 
analysis  of  variance  (3  Single/Dual  Tasks  x  2 
Probe  Types  x  3  Electrodes).  A  significant 
main  effect  was  obtained  for  the  task  vari¬ 
able,  F(2,12)  =  5.0,  p  <  0.05.  Post  hoc  com¬ 
parisons  indicated  that  this  effect  could  be 
attributed  to  larger  NIOOs  in  the  single-task 
condition  {p  <  0.05).  The  amplitude  of  the 
NIOOs  did  not  differ  between  the  easy  and 
difficult  flight  missions.  No  other  main  ef¬ 
fects  or  interactions  were  significant  for 
N  100  amplitude  or  latency  measures.  The  la¬ 
tency  and  amplitude  measures  for  the  later 
frontally  negative  component,  the  N200, 
were  also  submitted  to  a  three-way,  re¬ 
peated-measures  analysis  of  variance.  None 
of  the  main  effects  or  interactions  were  sig¬ 
nificant  for  this  component. 

CONCLUSIONS 

The  results  of  the  present  experiment  pro¬ 
vide  preliminary  support  lor  the  assertion 
that  components  of  the  ERP  can  provide  sen¬ 
sitive  and  reliable  measures  of  the  task  de¬ 
mands  imposed  upon  operators  of  complex, 
real-world  systems.  One  component  of  the 
ERP  in  particular,  the  P300,  varied  in  a  sys¬ 
tematic  manner  in  response  to  the  demands 
of  different  versions  of  the  flight  task.  Rela¬ 
tive  to  conditions  in  which  the  tone  discrimi¬ 
nation  task  was  performed  alone,  the  ampli¬ 
tude  of  the  P300  decreased  when  the  student 
pilots  performed  the  easy  version  of  the  flight 
task.  Further  decreases  in  P300  amplitude 
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were  observed  when  the  dillieultv  ol  the 
flight  task  was  increased  through  the  manip¬ 
ulation  of  wind  speed,  turbulence,  and  the 
probability  of  subsystem  failures.  Although 
the  within-mission  effects  were  not  as  dra¬ 
matic  as  the  between-mission  comparisons,  a 
small  but  significant  correlation  between 
P30O  amplitude  and  heading  deviation  was 
obtained  across  flight  segments.  Another 
component  of  the  ERP,  the  N100,  varied  in 
amplitude  as  a  function  of  the  number  ol 
tasks  that  the  subjects  performed. 

The  sensitivity  of  the  P300  to  the  pro¬ 
cessing  demands  ol  the  different  flight  mis¬ 
sions  is  noteworthy  lor  several  reasons.  First, 
the  changes  in  the  amplitude  ol  the  P300  as  a 
function  of  task  demands  mimics  the  modu¬ 
lation  of  resources  presumed  to  underlie  vari¬ 
ations  in  operator  performance.  Resource 
models  predict  that  as  task  demands  in¬ 
crease,  additional  resources  will  be  allocated 
to  the  high-priority  task,  thereby  with¬ 
drawing  resources  (torn  tasks  ol  lesser  im¬ 
portance  (Navon  and  Gopher,  1979;  Wiekens, 
1980).  The  amplitude  of  the  P300s  elicited  by 
the  secondary  probe  task  decreased  with  in¬ 
creases  in  the  dillieultv  of  the  1  light  task. 
Other  studies  have  lound  that  P300s  elicited 
b\  primary  task  events  increase  in  amplitude 
with  increases  in  task  demands  (Kramer  et 
al . ,  1085).  Thus,  it  appears  that  the  P300  pro¬ 
vides  a  measure  of  the  hypothetical  resources 
that  can  only  be  inferred  from  more  tradi¬ 
tional  measurement  techniques. 

A  second  point  concerns  the  use  of  the  sec¬ 
ondary  task  procedure  in  the  assessment  of 
mental  workload.  The  resource  demands  of  a 
primary  task  are  usually  inferred  from  decre¬ 
ments  in  secondary  task  performance  (Ogden 
et  al.,  1979).  However,  a  particular  difficulty 
ol  the  secondary  task  methodology  is  the  in¬ 
trusion  of  the  secondary  task  into  primary 
task  performance,  thereby  complicating  the 
interpretation  ol  the  performance  decre¬ 
ments.  In  the  present  study,  our  secondary 


probe  task  required  a  relatively  simple  dis¬ 
crimination  and  an  occasional  overt  re¬ 
sponse.  In  fact,  secondary  task  performance 
did  not  discriminate  among  levels  of  primary 
task  difficulty.  Secondary  task  reaction  time 
and  accuracy  were  uniformlv  high  in  all  con¬ 
ditions.  On  the  other  hand,  the  P300  elicited 
by  the  probe  stimuli  did  discriminate  among 
the  demands  of  the  flight  task.  Thus,  the 
ERP-eliciting  probe  task  provides  a  sensitive 
metric  of  resource  demands  without  in¬ 
truding  upon  the  performance  of  the  task  of 
interest,  a  clear  advantage  in  operational  set¬ 
tings.  However,  this  is  not  to  imply  that  even 
a  relatively  nonintrusive  secondary  task  is  an 
ideal  workload  assessment  procedure  in 
complex,  real-world  systems.  Clearly,  a  more 
acceptable  solution  would  be  the  elicitation 
of  ERP  components  by  primary  task  events, 
thereby  negating  the  requirement  for  any 
type  of  secondary  task.  Such  a  procedure  has 
been  successfully  employed  in  the  labora¬ 
tory,  and  we  are  currently  exploring  its  effi¬ 
cacy  in  operational  settings  (Kramer, 
Wiekens,  Vanasse,  Heffley,  and  Donchm, 
1981;  Sirevaag,  Kramer,  Coles,  and  Donchin, 
1984). 

A  third  point  concerns  the  nature  of  the 
metric.  Although  ERP  components  certainly 
qualify  as  physiological  measures,  they  are 
somewhat  unique  in  that  they  are  selectively 
sensitive  to  a  subset  of  processing  demands. 
Autonomically  mediated  measures  such  as 
heart-rate, variability,  respiration,  blood 
pressure,  and  skin  conductance  are  in¬ 
fluenced  by  ambient  environmental  condi¬ 
tions,  anxiety,  and  physical  exertion,  in  addi¬ 
tion  to  mental  workload  (Wierwille,  1979). 
Therefore,  these  measures  are  sensitive  to 
workload  in  general  but  are  not  diagnostic  in 
the  sense  of  identifying  the  source  of  the  pro¬ 
cessing  demands.  The  P300  is  sensitive  to  in- 
formation-processing  demands,  and,  more 
specifically,  is  influenced  by  perceptual/cog¬ 
nitive  demands  but  not  by  motor  processes. 
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Thus,  unlike  other  physiological  measures. 
ERP  components  are  quite  diagnostic. 

The  unique  characteristics  ot  endogenous 
ERP  components  make  them  well  suited  to 
serve  as  indices  ol  mental  workload.  How¬ 
ever,  since  workload  appears  to  be  best  repre¬ 
sented  as  a  multidimensional  rather  than  a 
scalar  quantity,  ERPs  can  best  tultill  their 
role  by  augmenting  other  workload  metrics. 
Only  through  the  joint  use  ol  a  number  ot 
measurement  techniques  can  we  expect  to 
elucidate  the  mental  wor  kload  ol  human  op¬ 
erators. 
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Abstract 

The  effects  of  foveal  task  difficulty  on  the  processing  of  events  in 
the  visual  periphery  were  investigated  through  an  analysis  of  evert— related 
brain  potentials  and  performance  measures.  Subjects  performed  a  foveally 
presented  continuous  monitoring  task  both  separately  and  together  with  an 
arrow  discrimination  task  which  was  presented  at  three  different  retinal 
eccentricities.  The  subjects  detected  occasional  failures  in  the  monitoring 
task  while  also  responding  to  designated  targets  in  the  left  and  right 
visual  fields.  The  analysis  of  the  event-related  brain  potentials  elicited 
by  discrete  events  in  the  arrow  discrimination  task  indicated  that  the 

amplitude  of  the  N190  and  P300  components  decreased  with  both  the 

Introduction  of  the  foveal  task  and  an  increase  in  its  difficulty.  The  N160 
component  was  sensitive  to  the  distribution  of  attention  within  a  task  but 
was  uninfluenced  by  dual-task  demands.  These  findings  suggest  that  the  N160 
reflects  the  distribution  of  attention  to  different  spatial  locations  within 

a  task  while  the  N190  may  index  the  distribution  of  general  purpose 

perceptual  resources.  P300  appears  to  index  the  allocation  of 
perceptual/central  processing  resources.  The  implications  of  the  results  for 
models  of  resource  allocation  and  attentlonal  gradients  are  discussed. 
DESCRIPTORS:  Selective  and  divided  attention,  dual-tasks,  resource 
allocation,  Event-related  brain  potentials  (ERP),  visual-spatial  attention, 
N160,  N190,  P300. 
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Introduction 

Research  conducted  to  examine  the  sensitivity  of  event-related  brain 
potentials  (ERPs)  to  attentional  processes  has  been  mainly  performed  within 
two  different  paradigms  (Donchin,  Kramer  and  Wickens,  1986;  Hillyard  and 
Kutas,  1983;  Kutas  and  Hillyard,  1984).  Moreover,  these  paradigms  are 
generally  associated  with  theoretical  frameworks  that  focus  on  different 
aspects  of  attention.  In  the  present  study,  we  have  designed  a  hybrid 
paradigm  that  allowed  us  to  investigate  the  interaction  between  two 
varieties  of  attentional  processes;  dual-task  processing  demands  and 
intratask  selective  attention.  Prior  to  describing  our  task  and  hypotheses 
in  more  detail  we  will  outline  the  paradigms  and  theoretical  perspectives 
from  which  our  study  is  derived. 

One  of  the  most  frequently  used  paradigms  is  a  modification  of  the 
classic  dichotic  listening/selective  looking  task  (Becklen  and  Cervone, 

1983;  Cherry,  1953;  Broadbent,  1958;  Moray,  1959;  Neisser  and  Becklen, 

1975).  In  the  ERP  based  version  of  this  task  subjects  are  instructed,  in 
different  conditions,  to  (a)  attend  to  information  in  one  of  two  or  more 
channels  and  ignore  information  presented  on  the  other  channels,  or  (b)  to 
divide  their  attention  among  the  channels  (Eason,  Harter  and  White,  1969; 
Eason  and  Ritchie,  1976;  Hillyard,  Hlnk,  Schwent  and  Picton,  1973).  Channels 
are  generally  defined  in  terms  of  levels  along  a  physical  dimension  of  a 
stimulus  (eg.  ear,  pitch,  loudness,  location,  color,  orientation).  Within 
each  of  the  channels  events  can  be  further  categorized  on  the  basis  of 
physical  or  semantic  differences.  The  subjects  are  instructed  to  respond 
each  time  one  of  two  events  occurs  on  the  attended  channel.  The  event  that 
requires  a  response  is  labelled  the  target  and  usually  occurs  with  a  lower 
probability  than  the  standards. 

Although  the  specific  ERP  components  that  are  recorded  in  the  dichotic 
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listening/selective  looking  paradigm  differ  somewhat  as  a  function  of 
modality  (Hillyard,  Simpson,  Woods,  Van  Voorhis  and  Munte,  1984),  a  number 
of  components  have  been  extensively  examined  across  modalities.  These 
components  include  an  early  negativity  that  peaks  at  approximately  80-120 
msec  in  the  auditory  modality  and  from  140  to  170  msec  in  the  visual 
modality.  Attention  effects  include  both  amplitude  modulations  of  these 
peaks  as  well  as  broader,  sometimes  overlapping  negativities  which  are  known 
as  Nd  in  the  auditory  domain  and  selection  negativity  in  visual  attention.  A 
later  positivity  occurring  at  approximately  300  to  700  msec  post-stimulus 
and  referred  to  as  the  P300  has  been  found  to  be  sensitive  to  attentional 
manipulations  in  the  auditory  and  visual  domains  (Harter  and  Aine,  1984; 
Naatanen,  1982;  Picton,  Campbell,  Baribeau-Braun  and  Proulx,  1978).  Several 
of  the  experimental  results  obtained  in  the  examination  of  the  sensitivity 
of  the  negativities  and  the  P300  component  to  the  manipulation  of 
attentional  processes  are  of  particular  relevance  in  the  present  study. 

First,  it  is  clear  from  experimental  results  that  a  hierarchy  of 
selective  mechanisms  exist.  The  amplitude  of  Che  N100  component 
discriminates  between  the  processing  of  information  in  attended  and 
unattended  channels,  while  later  negativities  and  the  P300  discriminate 
among  target  and  non-target  items  within  the  attended  channel  (Hillyard, 
Munte  and  Neville,  1985;  Naatanen,  1982).  Since  channels  have  been  defined 
in  terms  of  levels  along  physical  dimensions,  this  pattern  of  results 
provides  strong  support  for  early  selection  models  of  attention  which  posit 
that  information  in  the  Ignored  channels  is  attenuated  by  a  filter  that  is 
tuned  on  the  basis  of  physical  stimulus  features  (Broadbent,  1982;  Johnston 
and  Dark,  1982;  Treisman,  1969;  Treisman  and  Riley,  1969).  This  filtering 
strategy  has  been  termed  stimulus  set  selection  (Broadbent,  1958). 

The  subsequent  negativities  and  P300  component  also  suggest  that 
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further  filtering  of  irrelevant  information  takes  place  later  in  the 
information  processing  system.  This  filtering  may  reflect  response  set 
mechanisms  posited  by  late  selection  models  of  attention  (Keele,  1973; 

Deutsch  and  Deutsch,  1963;  Duncan,  1980;  Shiffrin,  McKay  and  Shaffer,  1976). 
Thus,  the  ERP  results  are  quite  consistent  with  both  empirical  findings  as 
well  as  recent  models  that  suggest  that  attentional  selection  is  relatively 
flexible  and  can  take  place  on  the  basis  of  either  physical  features  or 
semantic  content  depending  on  task  constraints  and  subject  strategies 
(Bashinski  and  Bacharach,  1980;  Beck  and  Ambler,  1973;  Dark,  Johnston, 

Myl es-Worsl ey  and  Farah,  1985;  Francolini  and  Egeth,  1980;  Kahneman  and 
Henik,  1981;  Kahneman  and  Treisman,  1984;  Paahler,  1984). 

Another  important  finding  obtained  in  the  ERP  baaed  dichotic 
listening/ selective  looking  paradigm  was  the  demonat rat  ion  of  attentional 
gradients  for  a  variety  of  physical  dimensions.  In  the  visual  modality, 

N 100s  elicited  by  unattended  events  have  been  found  to  systematically 
decrease  in  amplitude  relative  to  attended  events  with  increasing  physical 
distance  (Mangun  and  Hillyard,  1987)  as  well  aa  increased  spatial  frequency 
separation  (Harter  and  Previc,  1978).  Similar  differences  have  been 
obtained  with  respect  to  changes  in  frequency  in  auditory  paradigms  (Alho, 
Sams,  Paavllalnen  and  Naatanen,  1986).  These  ERP  results  are  consistent  with 
several  recent  psychophysical  studies  that  have  obtained  evidence  for 
attentional  gradients  in  both  two  (Erikaen  and  Yeh,  1985;  LaBerge  and  Brown, 
1986;  Shulman,  Wilson  and  Sheehy,  1985)  and  three  dimensional  space  (Downing 
and  Pinker,  1985). 

A  Zoom  Lens  model  has  been  proposed  to  account  for  attentional 
gradients  in  visual  space  (Erlksen  and  Saint  James,  1986).  This  model 
suggests  that  attention  can  operate  along  a  spatial  continuum  ranging  from 
tightly  focused  to  widely  distributed.  When  attention  is  focused,  items  in 
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the  visual  field  are  processed  sequentially  but  with  a  high  degree  of 
precision.  When  attention  is  distributed,  items  are  processed  in  parallel 
but  with  a  lower  degree  of  precision.  Attentional  gradients  are  presumed  to 
result  from  a  gradual  dropoff  in  processing  resources  towards  the  boundary 
of  the  attenclonal  focus.  The  ERP  data  suggest  that  the  mechanism 
underlying  the  phenomenon  of  attentional  gradients  operates  at  a  relatively 
earlv  stage  of  processing. 

The  Zoom  Lens  model  employs  a  resource  metaphor  to  account  for  the 
ditferences  in  speed  and  precision  of  processing  during  focused  and  divided 
attention.  A  fixed  quantity  of  resources  is  proposed  to  be  available  for 
processing  of  events  in  the  visual  field.  Resource*  are  evenly  distributed 
with  a  low  density  during  divided  attention,  while  being  concentrated  with  a 
high  density  durlr^  focused  attention.  Thus,  the  density  of  resources  limits 
the  rate  and  extent  of  processing  of  events  vichin  the  focus  of  attention. 

It  Is  noteworthy  that  ERP  results  are  quire  consistent  with  the  resource 
metaphor.  The  amplitude  of  the  N100  has  been  found  to  be  largest  when 
elicited  by  events  In  attended  channels,  intermediate  in  amplitude  in 
divided  attention  conditions,  and  amalleat  when  elicited  by  unattended 
events  during  focused  attention  (Hink,  Van  Voorhls,  Hillyard  and  Smith, 

1977;  Okita,  1979;  Parasuraman,  1978,  1985).  Furthermore,  the  summed 
amplitude  of  the  N100*  remain*  constant  between  conditions  of  focused  and 
divided  attention  (hink  et  si.,  1977;  Van  Voorhls  and  Hillyard,  1977). 

Thus,  the  amplitude  of  the  N100  component  appears  to  reflect  the  density  of 
resources  allocated  to  events  as  well  as  the  fixed  quantity  of  resources 
available  for  processing. 

Although  the  examination  of  ERPs  within  the  dichotic 
listening/selective  looking  paradigm  has  produced  a  wealth  of  valuable 
information  concerning  the  mechanisms  underlying  focused  and  divided 
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attention,  research  in  this  paradigm  has  been  limited  to  the  Investigation 
of  intratask  attentional  processes.  Thus,  one  important  question  not 
addressed  within  this  research  program  is  the  degree  of  generalizability  of 
attentional  phenomena  to  multitask  environments.  The  issue  of  attentional 
tradeoffs  between,  rather  than  within,  tasks  has  been  addressed  in  the  dual 
task  paradigm.  Within  the  dual  task  paradigm  subjects  perform  two  tasks 
both  separately  and  together.  Dual  task  demands  are  manipulated  by  varying 
the  difficulty  and/or  the  processing  priorities  of  one  or  both  of  the  tasks. 
Although  dual  task  ERP  studies  have  concentrated  mainly  on  the  sensitivity 
of  P300  to  changing  attentional  demands,  several  noteworthy  findings  have 
been  obtained. 

When  subjects  are  instructed  to  maximize  their  performance  on  one  task 
and  the  difficulty  of  this  task  is  manipulated,  P300s  elicited  in  the 
secondary  task  decrease  systematically  with  increases  in  the  difficulty  of 
the  primary  task  (Isreal,  Chesney,  Wickens  and  Donchin,  1980:  Kramer, 

Wickens  and  Donchin,  1985;  Kramer,  Sirevaag  and  Braune,  1987;  Lindholm, 
Cheatham,  Koriath  and  Longbridge,  1984;  Natani  and  Comer,  1981).  In 
conditions  in  which  ERPs  are  also  recorded  in  the  primary  task,  the 
amplitude  of  the  ?300s  increase  with  increases  in  the  difficulty  of  this 
task  (Horst,  Munson  and  Ruchkin,  1984;  Kramer,  Wickens  and  Donchin,  1985; 
Sirevaag,  Kramer,  Coles  and  Donchin,  1984;  Wickens,  Kramer,  Vanasse  and 
Donchin,  1983).  The  reciprocal  relationship  between  P300s  elicited  by 
primary  and  secondary  task  stimuli  is  consistent  with  the  attentional 
tradeoffs  presumed  to  underlie  dual  task  performance  decrements  (Freidman 
and  Poison,  1981;  Kahneman,  1973;  Navon  and  Gopher,  1979;  Sanders,  1979; 
Wickens,  1980,  1984).  That  is,  attentional  resource  models  predict  that  as 
the  difficulty  of  one  task  is  increased,  additional  resources  are 
reallocated  to  that  task  in  order  to  maintain  performance,  thereby  depleting 
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the  supply  of  resources  that  could  have  been  used  in  the  processing  of  other 
tasks.  Resource  models  also  predict  that  attentional  tradeoffs  should  occur 
when  subjects  shift  their  emphasis  from  one  task  to  another,  even  in  the 
absence  of  difficulty  manipulations.  P300s  have  also  been  found  to  mimic 
the  shifts  in  attentional  resources  expected  with  changes  in  the  processing 
priorities  of  two  tasks  (Hoffman,  Houck,  MacMillian,  Simons  and  Oatman, 

1985;  Kramer  and  Strayer,  1987).  Thus,  the  P300  appears  to  provide  a 
measure  of  attentional  tradeoffs  that  can  only  be  Inferred  from  more 
traditional  performance  measures. 

The  ERP  results  obtained  in  the  dichotic  listening/selective  looking 
and  dual  task  paradigms  have  provided  converging  evidence  for  theories  of 
attention  as  well  as  new  insights  into  the  mechanisms  underlying  attentional 
phenomena.  However,  there  are  a  number  of  questions  that  remain  to  be 
addressed,  particularly  in  regards  to  the  interaction  of  the  varieties  of 
attention  investigated  in  these  paradigms.  In  the  present  study  we  will 
examine  a  number  of  questions  relevant  to  the  effects  of  intertask  demands 
on  the  processing  of  stimuli  in  the  visual  field. 

One  such  question  concerns  the  level  of  processing  at  which  attentional 
resources  are  transferred  from  one  task  to  another  during  dual  task 
performance.  Although  NIOOs  have  been  found  to  decrease  in  amplitude  from 
focused  to  divided  attention  conditions  within  a  single  task,  it  is  unknown 
whether  this  effect  generalizes  from  single  task  to  dual  task  conditions. 

At  a  finer  level  of  analysis,  it  becomes  relevant  to  ask  if  and  how  the 
visual-spatial  N100  attention  effect  is  influenced  by  intertask  demands.  Is 
it  the  case  that  the  amplitude  of  the  NIOOs  elicited  by  both  attended  and 
ignored  events  are  reduced  or  does  the  reduction  reflect  a  selective 
focusing  of  attention  (eg.  diminished  ignored  versus  attended  NIOOs)?  This 
same  question  is  relevant  for  the  P300  component.  Previous  investigations 
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of  the  sensitivity  of  the  P300  to  changes  in  intertask  demands  have  focused 
solely  on  the  P300s  elicited  by  task  relevant  events.  In  the  present  study 
we  examine  the  degree  to  which  the  sensitivity  of  the  P300  to  intralocation 
and  interlocation  selective  processes  is  influenced  by  demands  Imposed  from 
another  task. 

A  second  important  question  concerns  the  effects  of  intertask  demands 
on  visual-spatial  attentional  gradients  A  similar  issue  has  been  addressed 
in  performance  based  paradigms.  In  these  studies  subjects  are  required  to 
selectively  respond  Lo  brief  presentations  of  items  in  the  periphery  of  the 
visual  field  while  also  performing  a  foveal  task.  In  a  number  of  such 
studies,  imposing  a  foveal  task  and  increasing  its  difficulty  has  led  to  a  - 
decrement  in  the  processing  of  information  in  the  periphery,  particularly  at 
the  most  distant  locations  (Ikeda  and  Takcuchi,  1975;  Williams,  1985).  This 
phenomenon  has  been  termed  cognitive  tunnel  vision  (Mackworth,  1965). 
Although  these  findings  suggest  the  intriguing  possibility  that  the 
functional  visual  field  constricts  with  increased  foveal  task  demands, 
several  aspects  of  the  research  may  limit  the  generalizability  of  this 
phenomenon. 

First,  cognitive  tunnel  vision  has  only  been  obtained  in  a  subset  of 
the  studies  examining  the  effects  of  foveal  task  load  on  peripheral 
processing.  In  many  cases  all  stimuli  in  the  periphery  were  more  poorly 
processed  with  increased  foveal  task  demands  (Antes  and  Edwards,  1973; 
Holmes,  Cohen,  Haith  and  Morrison,  1977;  Voss,  1981;  Williams,  1982). 

Second,  the  effects  have  been  found  only  in  situations  in  which  the  foveal 
and  peripheral  stimuli  were  presented  very  briefly.  Third,  since  only 
measures  of  accuracy  or  RT  were  collected,  the  level  of  processing  at  which 
this  effect  occurred  could  not  be  assessed.  In  the  present  study  we  will 
examine  the  effects  of  foveal  task  load  on  the  selective  processing  of  items 
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in  the  periphery  by  increasing  the  difficulty  of  a  foveally  presented 
continuous  monitoring  task  and  examining  the  ERPs  elicited  by  stimuli 
presented  at  different  retinal  eccentricities. 

Method 

Subjects  Ten  dextral  graduate  students  (five  male)  were  paid  for  their 
participation  in  the  study.  All  of  the  subjects  were  between  the  ages  of  20 
and  26  and  had  normal  or  corrected  to  normal  vision.  None  of  the  subjects 
had  any  prior  experience  with  the  tasks  employed  in  the  experiment. 

Triangle  Monitoring  and  Arrow  Discrimination  Tasks  Subjects  performed 
two  tasks  both  separately  and  together.  The  tasks  are  graphically  depicted 
in  Figure  1.  The  foveal  task  required  subjects  to  monitor  a  dynamic  process 
for  occassional  failures.  The  multi-attribute  process  was  represented  by  a 
triangle  which  changed  shape  as  a  function  of  three  system  variables.  The 
triangle  was  displayed  at  the  center  of  a  CRT  positioned  88  cm  directly  in 
front  of  the  subject.  The  difficulty  of  the  monitoring  task  was  varied  by 
manipulating  the  complexity  of  the  rule  for  detecting  a  system  failure.  Two 
independent  sums  of  six  sine  waves  were  used  to  update  the  length  of 
segments  II  and  12.  Note  that  the  sum  of  these  segments  determined  the 
length  of  the  base  of  the  triangle. 

In  the  correlated  monitoring  condition,  changes  in  the  length  of 
segment  0  (which  determined  the  height  of  the  triangle)  were  perfectly 
correlated  with  changes  in  segments  II  and  12.  During  a  system  failure,  the 
length  of  segment  0  was  driven  by  a  third  sum  of  sines  function, 
uncorrelated  with  the  changes  in  segments  II  and  12.  Thus,  a  system  failure 
was  said  to  occur  when  the  relationship  between  the  height  and  the  base  of 
the  triangle  changed  from  a  correlated  to  an  uncorrelated  state.  The  change 
from  a  normal  to  an  abnormal  system  took  place  in  the  form  of  a  ramp 
function  over  a  1200  msec  epoch. 
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During  the  uncorrelated  monitoring  condition,  the  contingencies  were 
reversed.  In  this  condition,  the  subjects  were  to  detect  when  the  system 
changed  from  the  uncorrelated  state  to  one  in  which  the  height  of  the 
triangle  depended  upon  changes  in  the  II  and  12  segments.  The  horizontal 
base  (segment  II  and  12)  of  the  triangle  varied  from  0.3  to  2.6  degrees  of 
visual  angle.  The  vertical  height  of  the  triangle  (segment  0)  varied  from 
0.3  to  2.0  degrees  of  visual  angle.  The  triangle  dimensions  were  updated 
every  50  msec. 

The  number  of  failures  that  could  occur  on  any  block  of  trials  varied 
from  10  to  15.  The  failures  occurred  with  a  stimulus  onset  asynchrony  (S0A  - 
the  time  between  the  end  of  one  failure  and  the  beginning  of  the  next) 
which  varied  randomly  between  5  and  50  sec.  Following  the  onset  of  a  system 
failure,  subjects  were  given  an  interval  of  6000  msec  to  signal  the 
detection  of  the  failure  by  depressing  a  button  with  the  thumb  of  one  hand. 
The  assignment  of  response  hand  to  the  detection  of  triangle  failures  was 
counterbalanced  across  subjects.  Responses  occuring  within  this  6000  msec 
interval  were  scored  as  hits  and  the  reaction  time  (RT)  was  recorded.  If 
the  subject  failed  to  respond  within  this  interval,  a  miss  was  scored  and 
the  parameters  of  the  algorithm  controlling  the  triangle  were  reset  to  the 
normal  system  state.  Button  press  responses  that  occured  when  the  triangle 
was  not  in  a  "failure"  state  were  scored  as  false  alarms  (FAs).  Based  upon 
previous  research  it  was  expected  that  the  detection  of  failures  in  the 

t 

uncorrelated  system  would  be  substantially  slower  and  more  error  prone  than 
in  the  correlated  system  (Casey  and  Wickens,  1986). 

In  the  parafoveal  task  subjects  were  required  to  selectively  attend  to 
either  the  right  or  left  visual  field  and  depress  a  response  button  whenever 
a  prespecified  target,  an  upward  or  downward  pointing  arrow,  was  presented. 
During  a  given  block  of  trials,  arrows  pointing  in  the  target  direction, 
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regardless  of  visual  field,  occurred  with  a  probability  of  .20,  arrows 
pointing  in  the  non-target  direction  occurred  with  a  probability  of  .80. 

The  probability  of  an  arrow  occurring  in  either  the  right  or  left  visual 
field  was  .50.  Thus,  the  number  of  trials  actually  requiring  a  response 
(those  that  pointed  in  the  target  direction  in  the  appropriate  visual  field) 
constituted  10Z  of  the  trials  during  a  given  block.  Subjects'  signaled  the 
detection  of  a  target  by  depressing  a  button  with  whichever  thumb  was  not 
used  to  respond  to  the  triangle  monitoring  task. 

The  arrows  were  presented  at  three  different  retinal  eccentricities 
(defined  in  terms  of  deviations  in  degrees  of  visual  angle  from  the  center 
of  the  screen);  ''close”  (1.4  degrees),  "middle"  (4.5  degrees),  and  "far" 
(9.1  degrees).  During  all  conditions  in  which  the  arrow  discrimination  task 
was  performed,  a  total  of  200  arrow  stimuli  were  presented  with  an 
inter-stimulus  interval  (ISI)  that  varied  between  1.2  and  1.5  sec.  Each 
arrow  was  presented  for  a  50  msec  duration  and  subtended  a  visual  angle  of 
1.3  degrees.  RT,  hits,  misses,  and  FAa  were  collected  for  each  experimental 
trial.  Failures  in  the  triangle  monitoring  task  and  targets  in  the  arrow 
discrimination  task  were  offset  by  at  least  1500  msec  in  the  dual  task 
conditions. 

Before  each  block  of  trials  subjects  were  told  which  visual  field  to 
attend.  Special  emphasis  was  placed  on  the  requirement  that  the  eyes  remain 
fixated  at  the  center  of  the  screen.  Trials  containing  eye  movements  and 
blinks  were  not  included  in  the  analysis.  Each  subject  performed  an  equal 
number  of  attend  left  and  attend  right  blocks.  The  retinal  eccentricity 
variable  was  similarly  blocked.  Thus,  stimuli  occurred  at  the  same  retinal 
eccentricity  within  the  attend  and  Ignore  visual  fields  durlr\g  a  single 
block  of  trials.  The  direction  of  the  target  arrow  was  counterbalanced 
across  subjects  such  that  an  individual  subject  always  responded  to  target 
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arrows  pointing  in  the  same  direction.  This  combination  of  variables 
produced  a  total  of  6  different  single  task  arrow  discrimination  conditions 
(2  visual  fields  x  3  retinal  eccentricities).  When  combined  with  the  two 
levels  of  triangle  monitoring  (correlated  and  uncorrelated)  a  total  of  12 
dual  task  conditions  resulted. 


Insert  Figure  1  About  Here 


ERP  Recording  Electroencephalographic  (EEG)  data  were  recorded  from 
four  midline  sites  (Fz,  Cz,  Pz,  and  Oz).  In  addition,  activity  at  two 
non-standard  lateral  occipital  sices  was  also  recorded.  These  lateral  leads 
were  located  2.5  cm  to  the  left  (designated  01)  and  2.5  cm  to  the  right 
(designated  02)  of  the  Oz  position.  All  active  EEG  electrodes  were 
referenced  to  linked  mastoids.  Two  ground  electrodes  were  positioned  on  the 
left  side  of  the  forehead. 

Electrooculographic  (EOG)  activity  in  the  vertical  plane  was  monitored 
by  electrodes  placed  above  and  below  the  subject's  left  eye.  Horizontal  EOG 
activity  was  monitored  by  placing  an  electrode  at  the  outer  canthus  of  each 
eye.  Beckman  Biopotential  electrodes  filled  with  Grass  paste  were  utilized 
at  all  EEG,  EOG,  reference,  and  ground  sices.  Electrode  impedances  were 
maintained  below  10  kohms. 

The  EEG  and  EOG  were  amplified  with  Van  Gogh  Model  50000  amplifiers 
(time  constant  10  s  and  upper-half  amplitude  of  70  Hz).  EEG  and  EOG 
channels  were  sampled  every  5  msec,  during  a  1000  msec  epoch  beginning  100 
msec  prior  to  the  onset  of  the  stimuli  in  the  arrow  discrimination  task. 
Single  trials  containing  blinks  or  horizontal  eye  movements  were  rejected 
prior  to  statistical  analysis. 

Stimulus  Generation  and  Data  Collection  Stimulus  presentation  and  data 
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acquisition  were  governed  by  a  DEC  PDP  11/73  human  performance  - 
electrophysiological  laboratory  computer  (Heffley,  Foote,  Mui,  and  Donchin, 
1985)  interfaced  with  an  Imlac  graphics  processor.  Single  trial  and  average 
EEC  and  EOG  data  were  monitored  on-line  by  use  of  a  GT-44  display.  The 
digitized  data  from  the  single  trials  were  stored  on  magnetic  tape  for 
off-line  analysis. 

Procedure  The  experimental  conditions  were  presented  in  a  blocked 
fashion.  Each  block  lasted  approximately  5  min  and  required  either  single 
task  triangle  monitoring,  single  task  arrow  discriminations,  or  dual  task 
performance. 

Subjects  participated  in  three  experimental  sessions  each  of  which  took 
place  on  separate  days.  The  first  session  was  considered  practice.  The 
data  from  this  session  were  not  analyzed.  Each  subject  performed  eleven 
blocks  of  trials  in  the  practice  session.  Two  of  these  blocks  did  not 
require  responses,  subjects  merely  monitored  the  movements  of  the  triangle. 
When  a  failure  occurred  the  word  "FAILURE"  was  presented  on  the  CRT 
immediately  above  the  triangle.  These  conditions  served  to  familiarize  the 
subject  with  the  correlated  and  uncorrelated  systems.  The  remaining  blocks 
required  the  subjects  to  respond  to  the  detection  of  system  failures.  In 
some  of  the  blocks  subjects  performed  the  triangle  task  alone,  while  in 
others  they  were  required  to  monitor  the  triangle  and  respond  to  the  stimuli 
in  the  arrow  discrimination  task. 

ERPs,  RTs  and  accuracy  measures  were  recorded  during  the  subsequent  two 
sessions.  During  a  given  session  subjects  performed  only  one  version  of  the 
monitoring  task  (correlated  or  uncorrelated).  Hie  presentation  order  was 
counterbalanced  across  subjects.  Subjects  performed  21  blocks  of  trials  in 
each  of  the  sessions.  Three  of  these  were  single  task  monitoring  (either 
correlated  or  uncorrelated).  Six  single  task  arrow  discrimination  blocks 
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were  also  completed  in  each  session  (2  visual  fields  x  3  retinal 
eccentricities).  Two  replications  of  each  of  the  six  dual  task  blocks,  the 
monitoring  task  paired  with  each  of  the  six  arrow  discrimination  conditions, 
completed  the  21  blocks  in  each  of  the  experimental  sessions.  The 
presentation  order  of  single  and  dual  tasks  was  counterbalanced  across 
subjects  and  sessions. 

Data  Analysis  Three  ERP  components  were  analyzed  in  this  experiment 
(N160,  N190,  and  P300).  For  the  two  negative  components,  amplitude  and 
latency  estimates  were  obtained  by  submitting  the  average  data  from  each 
electrode  to  an  algorithm  which  selected  the  most  negative  peak  relative  to 
the  mean  value  of  the  pre-stimulus  baseline  in  a  latency  window  derived  from 
visual  inspection  of  the  grand  mean  waveform  (100  to  300  msec 
post-stimulus) . 

Because  latency  variability  in  single  trial  P300s  can  contribute  to 
amplitude  differences  in  average  waveforms,  estimates  of  P300  amplitude  and 
latency  were  derived  from  the  single  trials.  These  estimates  were  derived 
using  two  different  techniques.  First,  the  waveforms  from  each  of  the 
electrodes  were  submitted  to  a  covariance  algorithm.  This  procedure 
computed  the  covariance  of  each  waveform  with  a  cosinusoidal  waveform  within 
a  moving  500  msec  window  beginning  300  msec  post-stimulus  and  ending  at  800 
msec  post-stimulus.  P300  latency  was  defined  as  the  midpoint  in  the  epoch 
which  provided  the  maximum  covariance,  and  P300  amplitude  was  estimated  from 
the  magnitude  of  the  maximum  covariance.  Trials  for  which  the  maximum 
wavef orm/template  correlation  coefficient  was  less  than  .35  were  discarded 
from  the  analysis.  The  second  signal  extraction  technique  employed  in  the 
estimation  of  P300  amplitude  and  latency  was  a  base  to  peak  measure  on  the 
single  trials.  The  largest  positive  peak  was  selected  in  a  300  to  800  msec 
post-stimulus  window.  Since  a  comparison  of  the  measures  obtained  via  cross 
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correlation  and  base  to  peak  did  not  reveal  any  significant  differences 
between  the  pattern  of  results,  the  F  ratios  will  be  reported  for  the  base 
to  peak  measures.  In  order  to  protect  against  positive  biases  in  the  F 
tests  the  significance  levels  were  computed  with  conservative  degrees  of 
freedom  (Greenhouse  and  Geisser,  1959). 

Results 

Performance  measures 


Triangle  Monitoring  Task 

RT  was  defined  as  the  interval  between  the  beginning  of  the  "failure" 
and  the  subjects'  keypress.  Consistent  with  our  predictions,  subjects  were 
more  successful  in  detecting  failures  when  monitoring  the  triangle  in  which 
the  two  base  legs  were  correlated  with  the  height  than  when  monitoring  the 
triangle  in  which  the  base  legs  and  height  varied  independently.  The  data 
presented  in  Table  1  represents  the  average  performance  measures  obtained  in 
the  dual  task  triangle  monitoring  conditions.  Two  separate  ANOVA's  were 
performed  on  the  triangle  monitoring  data;  a  comparison  of  single  and  dual 
task  performance  and  a  separate  dual  task  ANOVA. 


Insert  Table  1  About  Here 


In  our  instructions  to  the  subjects  we  indicated  that  they  should 
protect  their  triangle  monitoring  performance  in  the  dual  task  conditions  at 
the  expense  of  their  performance  in  the  arrow  discrimination  task.  Thus,  it 
was  emphasized  that  the  triangle  monitoring  task  was  of  primary  importance 
and  that  their  performance  in  this  task  should  be  maintained  at  single  task 
levels.  In  order  to  interpret  secondary  task  decrements  as  due  to  the 
allocation  of  scarce  resources,  rather  than  simply  due  to  a  tradeoff  in 
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performance  in  one  task  against  the  other,  primary  task  performance  must 
remain  unchanged  (Wickens,  1984).  A  comparison  between  single  and  dual  task 
triangle  monitoring  measures  Indicated  that  subjects  did  protect  their 
performance  in  the  primary  task.  RT,  hits,  misses  and  FA's  did  not  differ 
in  the  single  and  dual  task  monitoring  conditions  (all  p's>.15).  1 

The  performance  data  obtained  in  the  dual  task  conditions  were  analyzed 
in  a  series  of  three-way  repeated  measures  ANOVA's  (10  subjects  x  2  tasks  x 
3  retinal  eccentricities).  As  predicted,  subjects  were  both  faster 
(F( 1 ,9)— 179.2  ,  p<.01 )  and  more  accurate  (F( 1 ,9)-13.7,  27.8,  19.7,  p<.01  for 
hits,  misses  and  FA's  respectively)  when  monitoring  for  failures  in  the 
correlated  than  in  the  uncorrelated  conditions.  This  same  pattern  of 
results  was  obtained  in  the  single  task  triangle  monitoring  conditions.  The 
retinal  eccentricity  of  the  arrow  stimuli  did  not  significantly  influence 
any  of  the  performance  measures.  Thus,  the  pattern  of  results  obtained  in 
the  monitoring  task  indicated  that  (a)  subjects  were  capable  of  maintaining 
their  single  task  performance  in  the  dual  task  conditions  and,  (b)  that  our 
manipulation  of  the  degree  of  correlation  between  triangle  dimensions  was 
successful  in  influencing  subjects'  performance  on  the  task. 

Arrow  Discrimination  Task 

Table  2  presents  the  average  RT  data  for  both  the  single  and  dual  task 
conditions.  Several  trends  in  the  data  are  noteworthy.  First,  RT  increased 
with  the  Introduction  of  the  triangle  monitoring  task.  Furthermore,  this 
increase  in  RT  was  larger  with  the  more  difficult  version  of  the  monitoring 
task.  Second,  RT  increased  as  the  arrows  were  presented  further  in  the 
periphery,  especially  in  the  single  task  condition.  The  results  of  a  three- 
way  repeated  measures  ANOVA  (10  subjects  x  3  task  conditions  x  3  retinal 
eccentricities)  confirmed  these  observations.  Subjects  were  significantly 
faster  when  performing  in  the  single  task  than  they  were  in  the  dual  task 
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conditions  (F(2 , 18)-37. 3 ,  <.01).  RT  also  increased  with  the  retinal 
eccentricity  of  the  arrow  stimuli  ( F( 2 , 18 )™25 . 3 ,  p<.Ql).  The  interaction 
between  task  condition  and  retinal  eccentricity  was  also  significant 
(F(4,36)»5.9,  p<.05)  indicating  that  the  increase  in  RT  was  larger  from  the 
single  cask  condition  to  the  correlated  dual  task  conditions  than  it  was 
from  the  correlated  to  the  uncorrelated  conditions.  Post-hoc  comparisons 
indicated  that  the  dual-task  conditions  were  reliably  different  at  the  close 
and  middle  arrow  positions. 2 


Insert  Table  2  About  Here 


A  similar  pattern  of  results  was  obtained  for  misses.  Subjects  failed 
to  respond  to  che  carget  arrows  more  fequently  in  the  correlated  dual  task 
conditions  than  in  the  single  task  conditions.  Misses  also  increased  from 
the  correlated  to  the  uncorrelated  dual  task  conditions  (F(2,18)-11.8, 
p<.01).  Number  of  misses  increased  as  a  function  of  retinal  eccentricity, 
with  more  misses  occurring  farther  in  the  periphery  (F(2,18)"6.5,  <.05). 

FA's  also  increased  as  a  function  of  the  retinal  eccentricity  of  the  arrow 
stimuli  (F(2, 18)-6.6,  P<.05). 

The  pattern  of  results  obtained  for  the  arrow  discrimination  task 
suggests  that  the  demands  Imposed  upon  the  subjects  by  the  foveally 
presented  triangle  monitoring  task  influenced  subjects'  performance  in  the 
peripheral  task.  RT  increased  while  accuracy  decreased  with  che 
Introduction  of  the  monitoring  task.  Furthermore,  the  increase  in  the 
difficulty  of  the  monitoring  task  produced  additional  decrements  in  the 
subjects'  performance  on  the  arrow  discrimination  task.  These  results  when 
viewed  in  conjunction  with  the  performance  measures  obtained  from  the 
triangle  monitoring  task,  argue  that  subjects  were  compensating  for 
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increases  in  primary  cask  demands  by  reducing  their  accuracy  and  increasing 
their  response  time  in  the  secondary  task. 

Although  the  performance  measures  provide  valuable  information 
concerning  the  resource  allocation  policies  adopted  by  the  subjects  to  cope 
with  the  increases  in  task  demands,  they  do  not  clarify  the  level  within  the 
information  processing  system  at  which  these  resources  were  exchanged.  The 
performance  measures  are  also  silent  on  the  degree  to  which  processing 
resources  are  withdrawn  from  different  elements  in  the  visual  field.  Thus, 
it  is  conceviable  chat  the  processing  of  all  peripherial  stimuli  may  be 
reduced,  the  processing  of  stimuli  in  the  unattended  spatial  locations  may 
be  reduced,  or  the  processing  of  stimuli  not  sharing  all  of  the  features  of 
the  target  stimuli  may  be  curtailed.  It  is  for  answers  to  these  questions 
that  we  turn  to  an  analysis  of  the  ERPs. 

Event-Related  Brain  Potentials 

The  ERP  section  is  organized  around  the  issues  of  intra-  and  intertask 
attention  as  well  as  the  components  of  the  ERP  that  have  been  proposed  to 
index  different  levels  of  attentional  analysis.  We  begin  by  describing  the 
effect  of  the  experimental  manipulations  on  the  amplitude  and  latency  of  the 
visual  N100  component.  Since  the  N100  recorded  at  occipital  sites  has 
previously  been'  found  to  differ  in  its  sensitivity  to  attentional 
manipulations  from  NIOOs  recorded  at  frontal,  central  and  parietal  sites, 
the  NIOOs  recorded  from  the  occipital  leads  will  be  analyzed  separately 
(Harter  and  Aine,  1984;  Mangun  and  Hillyard,  1987;  Rugg,  Milner,  Lines  and 
Phalp,  1987;  Van  Voorhis  and  Hillyard,  1977).  These  components  will 
henceforth  be  labelled  according  to  their  average  latency,  N160  for  the 
component  recorded  at  the  frontal,  central,  and  parietal  sites  and  N190  for 
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the  occipitally  recorded  component.  The  amplitude  and  latency  of  the  P300 
component  will  be  characterized  in  terms  of  their  sensitivity  to  changes  in 
processing  demands  and  selective  attention. 

Preliminary  analysis  of  the  effects  of  visual  field  (attend  left  or 
right)  on  the  CRP  components  did  not  produce  any  significant  effects. 
Therefore,  the  ERPs  elicited  by  the  attended  arrows  in  the  left  and  right 
visual  fields  were  averaged  together  prior  to  further  analysis.  The  same 
procedure  was  followed  for  the  averaging  of  the  unattended  stimuli.  For  the 
lateralized  electrode  placements  (01  and  02)  preliminary  analysis  indicated 
chat  electrode  site  did  not  interact  with  other  experimental  variables. 
Therefore,  since  attentlonal  effects  have  previously  been  shown  to  differ 
between  ipsilateral  and  contralateral  configurations,  averages  were  produced 
for  the  attended  and  unattended  stimuli  in  the  contralateral  and  ipsilateral 
fields. 


Insert  Figures  2,3,4  &  5  About  Here 


Early  Selective  Attention  and  Processing  Demands  -  N100  Components 

The  ERPs  recorded  in  single  and  dual  task  conditions  for  standards  at 
the  Fz,  Cz  and  Pz  electrode  sites  and  targets  at  the  Pz  site  are  presented 
in  figures  2  through  5,  respectively.  As  can  be  seen  from  the  figures,  the 
ERPs  were  composed  of  a  number  of  positive  and  negative  peaks.  Of 
particular  interest  in  the  present  study  are  the  early  negativity  peaking  at 
160  msec  post-stimulus,  and  the  late  positivity  which  is  largest  for  the 
target  stimuli  and  peaks  at  approximately  450  msec  post-stimulus. 

N160  Component  The  N160  component  was  quantified  by  computing 


base-peak  measures  of  amplitude  and  latency  for  each  of  the  single  and  dual 
task  conditions.  These  values  were  then  submitted  to  five-way  repeated 
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measures  ANOVAs  (10  subjects  x  3  task  levels  x  3  retinal  eccentricities  x  2 
attention  conditions  x  2  stimulus  types)  for  Fz,  Cz  and  Pz  electrode  sites. 

As  can  be  seen  from  figures  2  through  3,  the  N160s  elicited  by  stimuli 
in  the  attended  field  were  larger  than  the  N160s  elicited  by  stimuli  in  the 
unattended  field.  The  effect  of  direction  of  attention  was  significant  at 
both  the  Cz  (F( 1 ,9)“5.9,  p<.05)  and  Pz  sites  (F( 1 ,9)«10.7 ,  p<.01)  while 
being  marginally  significant  at  Fz  (F(l,9)«4.7,  p<.06).  A  significant 
two-way  interaction  between  attention  and  retinal  eccentricity  was  also 
obtained  at  the  Cz  (F(2,18)-5.5,  p<.05)  and  Pz  sites  (F(2,18)“6.9,  p<«05). 
The  direction  of  the  interaction  is  apparent  from  a  comparison  of  the 
amplitude  measures  presented  in  Table  3.  Post-hoc  comparisons  indicated 
that  the  attention  effects  were  significant  for  the  middle  and  far  retinal 
eccentricities  but  not  for  the  close  position.  Furthermore,  the  increase  in 
the  amplitude  of  the  N160  as  a  function  of  eccentricity  was  significant  only 
for  the  attended  field  stimuli.  This  pattern  of  results  is  consistent  with 
Hillyard  and  Munte  (1984)  who  found  that  when  stimuli  were  presented  in 
close  proximity,  N160s  did  not  differentiate  between  attended  and  unattended 
locations.  The  results  also  appear  to  suggest  that  the  increase  in  the 
attention  effect  with  retinal  eccentricity  is  due  to  an  increase  in  the 
N160s  elicited  by  the  attended  field  stimuli  rather  than  a  decrease  in  the 
N1608  elicited  by  stimuli  in  the  unattended  field. 


Insert  Table  3  About  Here 


Although  it  appears  upon  inspection  of  the  grand  average  waveforms  that 
the  absolute  amplitude  of  the  N160s  may  have  declined  from  single  to  dual 
cask  conditions  this  effect  was  not  significant  at  any  of  the  electrodes 
(all  p's  >.25).  Furthermore,  task  level  did  not  interact  with  the  attention 
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effect.  Thus,  the  increased  demands  imposed  upon  subjects  by  the 
introduction  of  the  monitoring  task  and  the  increase  in  its  difficulty  was 
not  reflected  in  the  level  of  attentional  selection  indexed  by  the  N160 
component.  This  pattern  of  results  is  particularly  interesting  when 
considered  in  light  of  other  findings  which  have  indicated  that  both 
auditory  (Hink,  Van  Voorhis,  Hillyard  and  Smith,  1977;  Parasuraman,  1978) 
and  visual  NIOO's  (Parasuraman,  1985;  Van  Voorhis  and  Hillyard,  1977) 
decrease  in  amplitude  when  subjects  are  required  to  divide  their  attention 
between  two  channels  of  information  instead  of  focusing  on  a  single  channel. 
In  the  dual  task  conditions  in  the  present  experiment,  however,  subjects 
divided  their  attention  between  two  tasks  rather  than  two  channels  of 
information  in  a  single  cask.  Thus,  the  N160  may  reflect  the  division  of 
attention  among  events  within  a  task  but  may  be  insensitive  to  inter-task 
demands. 

A  significant  main  effect  for  stimulus  type  was  obtained  at  the  Fz 
(F(l,9)«12.3,  p<.01)  and  Cz  sites  (F(l,9)-7.3,  p<.05).  The  larger  N160s 
elicited  by  the  targets  may  be  due  to  the  probability  difference  between  the 
two  classes  of  stimuli.  This,  in  turn,  would  lead  to  a  longer  1SI  between 
targets  than  standards  thereby  allowing  less  recovery  for  the  standards 
(Plcton,  Woods  and  Proulx,  1978;  Woods  and  Courchesne,  1986;  Woods, 
Courchesne,  Hillyard  and  Galambos,  1980).  Another  possibility  is  that  the 
smaller  number  of  trials  in  the  target  average  may  have  resulted  in  greater 
amplitude  variability  for  the  targets  than  the  standards.  It  is  interesting 
to  note  that  although  there  was  a  significant  main  effect  for  stimulus  type, 
the  lack  of  an  interaction  of  stimulus  type  with  other  experimental 
variables  indicates  that  the  attentional  manipulations  had  the  same  effect 
on  the  targets  and  the  standards. 

A  small  Increase  in  N160  latency  was  observed  from  the  frontal  to  the 
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parietal  site  (160,  165,  169  msec  for  Fz,  Cz  and  Pz;  respectively).  The  only 
significant  effect  for  N160  latency  was  an  interaction  between  task  level 
and  attention  condition  (F(2 , 18)«7 .0,  p<.05)  at  Fz.  Post-hoc  tests  revealed 
that  the  N160s  elicited  by  ignored  stimuli  in  the  difficult  dual  task 
conditions  were  significantly  longer  (171  versus  158  msec)  than  the  N160s 
elicited  by  the  attended  and  ignored  stimuli  in  the  other  single  and  dual 
task  conditions. 


Insert  Figures  6,  7  &  8  About  Here 

N190  Component  The  N190  component  was  quantified  by  computing  the 
base  to  peak  measures  of  amplitude  and  latency  for  each  of  the  single  and 
dual  cask  conditions  for  Oz  as  well  as  the  ipsllateral  and  contralateral 
conf igurations.  These  values  were  then  submitted  to  a  six-way  repeated 
measures  ANOVA  (10  subjects  x  3  task  levels  x  3  retinal  eccentricities  x  2 
attention  conditions  x  2  stimulus  types  x  3  electrode  sites). 

Consistent  with  the  N160  findings,  a  significant  main  effect  was 
obtained  for  attention  (F(l ,9)"11.0,  p<.01).  Stimuli  occurring  in  the 
attended  field  elicited  larger  N190s  than  stimuli  in  the  unattended  field. 
However,  the  main  effect  of  attention  is  qualified  by  several  two-  and 
three-way  interactions.  As  can  be  seen  from  inspection  of  figures  6,  7  and 
8,  the  attention  effect  is  largest  at  the  contralateral  sites,  somewhat 
smaller  at  Oz  and  absent  at  the  ipsllateral  sites  (F(2,18)*9.3,  p<.05).  The 
two-way  interaction  between  task  level  and  attention  was  also  significant 
(F(2,18)-6.1,  p<.05) .  Post-hoc  tests  indicated  that  the  introduction  of  the 
monitoring  task  decreased  both  the  overall  amplitudes  of  the  N190's  as  well 
as  the  difference  between  attended  and  ignored  stimuli.  These  effects  as 
well  as  the  interaction  among  electrode  site,  attention  and  task  level 
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(F(4,36)«10.2,  p<.05)  are  illustrated  by  the  grand  mean  N190  amplitude 
values  presented  in  Table  4.  Post-hoc  comparisons  indicated  that  although 
the  amplitude  of  the  N190s  elicited  by  the  attended  stimuli  were  reduced  for 
the  dual  task  conditions  at  the  contralateral  site,  the  difference  between 
attended  and  ignored  conditions  was  significant  for  both  the  single  and  dual 
tasks.  For  the  Oz  site,  the  N190s  were  reduced  in  amplitude  in  the  dual 
task  conditions  and  the  attention  effect  was  no  longer  significant.  For  the 
ipsilateral  site,  the  introduction  of  the  monitoring  task  significantly 
decreased  the  amplitude  of  the  N190a.  The  attention  effect  was 
nonsignificant  in  single  and  dual  task  conditions. 

This  pattern  of  results  would  appear  to  indicate  that  the  increased 
demands  Imposed  upon  the  subjects  in  the  dual  task  conditions  had  their 
major  impact  on  the  amplitude  of  the  N190s  elicited  by  the  attended  stimuli. 
A  secondary  effect  of  the  task  demands  was  to  diminish  the  differences 
between  the  attended  and  ignored  stimuli  for  the  contralateral  and  Oz  sites. 
Thus,  it  appears  that  the  selective  attention  effect  reflected  by  the  N190 
is  sensitive  to  inter  as  well  as  intra-task  demands.  These  results  contrast 
with  the  Insensitivity  of  the  N160  to  inter-task  demands  thereby  suggesting 
that  N160  and  N190  reflect  different  aspects  of  early  selective  attention. 


Insert  Table  4  About  Here 


The  interaction  among  retinal  eccentricity,  attention  and  electrode 
(F(2,18)*5.9,  <.05)  is  illustrated  in  Table  3.  Post-hoc  comparisons 
Indicated  that  the  differences  between  attended  and  ignored  conditions  were 
significant  for  the  middle  and  far  retinal  eccentricities  but  not  for  the 
close  position  at  the  Oz  site.  For  the  contralateral  site,  the  attention 
effect  was  significant  at  each  of  the  retinal  eccentricities.  There  was 
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also  a  significant  increase  in  the  amplitude  of  the  attended  N190s  and  a 
decrease  in  the  unattended  N190s  as  a  function  of  eccentricity  for  both  the 
Oz  and  contralateral  sites.  Neither  retinal  eccentricity  nor  attention  had 
significant  effects  on  the  N190s  elicited  at  the  ipsilateral  sites.  These 
results  are  interesting  in  that,  for  the  most  part,  they  are  quite 
consistent  with  the  changes  in  the  N160  component.  However,  there  is  one 
notable  exception.  The  increased  attention  effect  with  retinal  eccentricity 
seems  to  be  due  to  an  increase  in  the  N190s  to  attended  field  stimuli  as 
well  as  a  decrease  in  N190s  elicited  by  stimuli  in  the  unattended  field. 

This  contrasts  with  the  N160  results  in  which  the  unattended  N160s  did  not 
change  as  a  function  of  retinal  eccentricity.  Thus,  the  N190  may  reflect 
both  enhanced  processing  of  the  attended  stimuli  as  well  as  the  diminished 
processing  of  the  unattended  stimuli  while  the  N160  may  index  only  the 
former  process. 

Other  significant  effects  Included  electrode  (F(2,18)-6.9,  p<.05)  and 
stimulus  type  (F( 1 ,9)»9.6,  p<.05).  The  amplitude  of  the  N190  was  larger  at 
the  contralateral  than  the  ipsilateral  or  Oz  sites.  Targets  elicited  larger 
NI90s  than  standards.  The  stimulus  type  variable  did  not  interact  with  any 
of  the  other  experimental  factors. 

N190  latencies  were  significantly  longer  when  recorded  contralateral  to 
stimulus  presentation  than  they  were  when  recorded  at  the  ipsilateral  or  Oz 
sites  (F(2,18)_5.3,  p<.05).  The  latency  of  the  N190  component  increased  from 
the  close  and  middle  to  the  far  retinal  eccentricity  (F(2,18)*5.9,  p<.05; 

183,  182  and  202  msec,  respectively).  The  N190a  elicited  by  the  targets 
were  significantly  later  than  those  elicited  by  the  standards  (F( l ,9)"16.0 , 
p< . 01 ;  200  versus  177  msec).  The  interaction  between  stimulus  type  and  task, 
was  also  significant  (F(2,18)»7.6,  p<.05),  indicating  that  for  the  target 
stimuli  N190s  elicited  in  the  single  task  conditions  were  earlier  than  those 
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elicited  in  the  dual  task  conditions  (181,  207  and  203  msec).  The  pattern 
of  latencies  obtained  for  the  target  N190s  is  consistent  with  the  pattern  of 
RTs  recorded  in  the  arrow  discrimination  task  (see  Table  2).  Thus,  it 
appears  that  by  180  msec  post-stimulus,  evidence  of  Che  effects  of  foveal 
cask  load  on  the  processing  of  stimuli  in  the  periphery  has  been  provided  by 
both  the  amplitude  and  the  latency  of  the  N190.  It  is  interesting  to  note 
that  the  difference  between  single  and  dual  task  RTs  is  66  msec  while  the 
difference  in  N190  latencies  is  22  msec.  These  results  suggest  that 
although  some  of  the  increase  in  processing  time  can  be  accounted  for  by  the 
attentlonal  processes  reflected  in  the  N190,  an  additional  66Z  of  the 
temporal  cost  attributed  to  the  imposition  of  the  monitoring  task  occurs  in 
processes  following  the  discrimination  between  attended  and  unattended 
events. 

As  suggested  by  Rugg  et  al.  (1987),  an  important  control  for  subjects' 
eye  fixation  during  the  performance  of  the  arrow  discrimination  task  is  the 
magnitude  of  asymmetries  in  the  latencies  of  the  ERP  components  elicited  by 
attended  and  unattended  events.  If  subjects'  fixations  deviated  towards  the 
attended  field  it  would  be  expected  that  the  ipsilateral-contralateral 
differences  would  be  larger  for  the  unattended  events  due  to  the  greater 
distance  between  fixation  and  stimulus  presentation.  Thus,  if  subjects  had 
modified  their  direction  of  gaze  as  a  function  of  the  relevance  of  the 
stimuli  in  a  visual  field,  a  significant  interaction  between  attention  and 
electrode  would  be  expected.  This  interaction  was  not  significant  for  the 
N190  component. 


Insert  Figure  9  About  Here 
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Late  Selection  and  Processing  Demands  -  P300  Component 

The  late  positive  component  occurring  between  300  and  800  msec 
post-stimulus  was  larger  for  the  low  probability  targets  than  the  high 
probability  standards  (compare  figure  5  with  figure  4)  and  increased  in 
amplitude  from  the  frontal  to  the  parietal  site  with  decreases  in  amplitude 
from  Pz  to  Oz  (see  figure  9).  It  can  also  be  seen  in  figures  2  through  S 
chat  the  late  positivity  elicited  by  the  attended  events  is  larger  chan  the 
positivity  eliciced  by  the  stimuli  in  Che  unattended  visual  field.  The 
latency  range,  scalp  distribution  and  sensitivity  to  manipulations  of 
probability  and  attention  are  consistent  with  the  criteria  employed  in  the 
definition  of  the  P300  component  (Pritchard,  1981;  Sutton  and  Ruchltin, 

1984). 

The  P300  component  was  quantified  by  computing  base  to  peak  measures 
for  each  of  the  single  and  dual  task  conditions.  These  values  were  then 
submitted  to  six-way  repeated  measures  ANOVAs  (10  subjects  x  3  task  levels  x 
3  retinal  eccentricities  x  2  attention  conditions  x  2  stimulus  types  x 
electrodes).  Separate  ANOVAs  were  performed  for  the  midline  and  lateral 
electrodes  for  the  measures  of  P300  amplitude  and  latency. 

Insert  Tables  5  and  6  About  Here 


Midline  Sites  Consistent  with  the  waveforms  presented  in  figures  2 
through  8,  P300  amplitude  was  significantly  larger  for  the  attended  than  the 
unattended  field  stimuli  (F(l ,9)-106.8,  p<.01).  Of  more  importance, 
however,  was  the  interaction  between  stimulus  type  and  attention 
(F( l ,9)»62.4,  p< .01).  Post-hoc  tests  indicated  that  P300s  not  only 
discriminated  between  attended  and  unattended  visual  fields  but  also  between 
targets  and  standards  within  the  attended  field.  Thus,  while  the  early 
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negativities  reflected  processes  of  interlocation  selectivity,  the  P300 
provided  an  index  of  both  incer  and  intralocation  selectivity.  The 
interaction  of  this  effect  with  electrode  site  is  illustrated  in  Table  5 
(F(3,27)-12.6,  C.01).  The  differences  between  P300s  elicited  by  target  and 
standard  stimuli  in  the  attended  and  unattended  fields  is  largest  for  Pzt 
somewhat  smaller  for  the  Cz  and  Oz  sites,  and  smallest  for  Fz. 

Table  6  presents  the  grand  average  P300  amplitudes  for  the  attended  and 
ignored  stimuli  at  the  three  retinal  eccentricities  collapsed  over  targets 
and  standards  (F(2, 18)«5.8,  p<.05).  This  pattern  of  results  is  similar  to 
that  obtained  for  the  N160  and  N190  components.  Post-hoc  comparisons 
indicated  that  the  amplitude  of  the  attended  field  P300s  increased  with 
Increased  retinal  eccentricity  while  the  amplitudes  of  the  unattended  field 
P300s  diminished.  It  is  interesting  to  note,  however,  that  for  the  P300  the 
attention  effects  are  significant  at  each  of  the  eccentricities.  For  the 
N160  and  the  N190  at  Oz,  attention  effects  were  significant  only  for  the 
middle  and  far  positions.  These  results  suggest  that  although  attentional 
selectivity  between  the  close  positions  may  not  have  occurred  early  in 
processing,  evidence  for  such  a  discrimination  was  provided  by  400  msec 
post-stimulus. 

Consistent  with  previous  research  (Isreal  et  al.,  1980a;  Kramer  et  al., 
1983,  1985,  Kramer,  Sirevaag  and  Braune,  1987)  the  increased  processing 
demands  imposed  upon  the  subjects  by  the  introduction  of  the  monitoring  task 
and  the  increase  in  its  difficulty  from  the  correlated  to  the  independent 
conditions  had  a  significant  effect  on  the  amplitude  of  the  P300s  elicited 
in  the  arrow  discrimination  task  (F(2,18)-22.8,  p<.01).  This  decrement  in 
P300  amplitude  with  increased  processing  demands  is  illustrated  for  the 
attended  target  stimuli  in  Figure  10.  The  amplitude  of  the  P300s  elicited 
in  the  arrow  discrimination  task  decreased  with  the  introduction  of  the 
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monitoring  task.  Increases  in  the  difficulty  of  the  monitoring  task  served 
to  further  attenuate  the  amplitude  of  the  P300.  The  decrease  in  P300 
amplitude  with  processing  demands  was  largest  at  the  Pz  site  (F(6,54)*6.5, 
p<.05). 


Insert  Figure  10  and  Table  7  About  Here 


Of  particular  interest  in  the  present  study  was  the  influence  of 
intertask  demands  on  the  intratask  selective  attention  effect.  Table  7 
presents  the  mean  P300  amplitude  values  for  the  significant  attention  x  task 
interaction  (F(  2 , 18)*>5.9 ,  p<.05).  As  can  be  seen  from  the  table,  the 
imposition  of  the  monitoring  task  resulted  in  a  decrease  in  the  amplitude  of 
the  P300s  elicited  by  both  attended  and  unattended  field  stimuli.  However, 
increased  monitoring  demands  within  the  dual  task  conditions  resulted  in 
decreases  in  the  amplitude  of  the  P30Gs  elicited  by  the  attended  but  not  the 
unattended  field  stimuli.  Thus,  the  reduction  in  the  attention  effect  with 
increased  intertask  demands  was  more  than  twice  aa  large  for  the  attended 
field  P300s  than  it  was  for  the  unattended  field  P300s  (3.6  versus  1.4 
microvolts).  It  is  interesting  to  note  that  although  the  P300  is  largest  in 
amplitude  for  the  target  stimuli  in  the  attended  field,  the  reduced 
attention  effect  with  increased  task  demands  is  not  specific  to  the 
intralocation  distinction.  Instead  the  amplitude  of  P300s  elicited  by  all 
events  within  the  attended  field  are  reduced  with  increased  intertask 
demands. 

P300  latencies  were  longer  for  the  target  stimuli  than  they  were  for 
the  standards  (F( 1 ,9)"27. 5,  p<.01).  The  latency  of  the  P300  also  increased 
as  a  function  of  task  level  (F(2, 18)-7.9,  p<.05;  435,  466  amd  487  msec). 
Although  the  interaction  between  task  level  and  retinal  eccentricity  was 
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only  marginally  significant  when  using  conservative  degrees  of  freedom 
(F(4,36)-4.7,  p<.06),  the  pattern  of  P300  latencies  was  quite  similar  to 
that  obtained  for  RT.  The  differences  in  P300  latencies  as  a  function  of 
retinal  eccentricity  were  attenuated  in  the  dual  task  conditions. 

Lateral  Sites  Consistent  with  the  results  obtained  at  the  midline 
sites,  P300  was  larger  for  the  attended  than  the  unattended  field  events 
(F(l ,9)«55.4,  p<.01 ).  Furthermore,  the  attention  effect  interacted  with 
stimulus  type  (F( 1 ,9)"56.4,  p<.01),  indicating  that  P300s  were  largest  for 
the  target  stimuli  in  the  attended  visual  field  (see  Table  5).  Thus, 
consistent  with  the  present  results  for  the  midline  sites  as  well  as 
previous  findings,  P300  amplitude  discriminated  between  attended  and 
unattended  spatial  locations  as  well  as  target  and  nontarget  items  within 
the  attended  location. 

P300  amplitude  decreased  from  the  single  task  to  the  correlated  dual 
task  conditions  (F(2,18)-13.4,  p<.01).  Increases  in  dual  task  difficulty 
also  produced  a  decrement  in  P300  amplitude.  The  interaction  between 
retinal  eccentricity  and  attention  was  significant  (F(2,18)»8.5,  p<.05). 
Post-hoc  comparisons  indicated  that  the  amplitude  of  attended  field  P300s 
Increased  with  eccentricity  while  the  amplitude  of  the  unattended  field 
stimuli  diminished.  Finally,  consistent  with  results  obtained  by  other 
investigators  a  main  effect  of  recording  site  was  obtained  (Hillyard  and 
Munte,  1984;  Neville  and  Lawson,  1987).  P300s  were  slightly  larger  (.34 

microvolts)  when  recorded  ipsilateral  to  stimulus  presentation  (F( 1 ,9)“7.4 , 
p<.05) . 

P300  latencies  were  longer  for  the  targets  than  they  were  for  the 
standards  (F( 1 ,9)*65.6 ,  p<.01).  The  main  effect  of  retinal  eccentricity 
(F(2, 18)*6.9,  p<.05)  as  well  as  the  Interaction  between  eccentricity  and 
task  level  (F(4,36)»3.2,  p<.05)  attained  statistical  significance.  As  can  be 
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seen  in  Table  2,  the  latency  of  the  P300  increased  with  retinal  eccentricity 
to  a  greater  extent  in  the  single  than  in  the  dual  task  conditions. 

Finally,  P300s  elicited  by  stimuli  ipsilateral  to  the  recording  site  were 
signif icancly  eariler  than  those  obtained  at  contralateral  sites 
(F(l,9)«5.8,  p< .05;  443  versus  457  msec). 

Discussion 

A  major  goal  of  the  present  study  was  the  examination  of  the  effects  of 
intra-  and  intertask  demands  on  ERP  manifestations  of  attentional  processes. 
However,  in  order  to  investigate  the  sensitivity  of  ERP  components  to 
attentional  manipulations  we  were  first  required  to  provide  evidence  for 
resource  tradeoffs.  The  pattern  of  RT  and  accuracy  results  obtained  for  the 
arrow  discrimination  task  suggested  that  the  demands  imposed  upon  the 
subjects  by  the  foveally  presented  triangle  monitoring  task  influenced 
subjects'  performance  In  the  peripheral  task.  RT  increased  while  accuracy 
decreased  with  the  Introduction  of  the  monitoring  task.  Furthermore,  the 
increase  in  the  difficulty  of  the  monitoring  task  produced  additional 
decrements  in  the  subjects'  performance  on  the  arrow  discrimination  task. 
With  respect  to  resource  models  of  attentional  allocation,  these  results  can 
be  Interpreted  in  terms  of  the  withdrawal  of  resources  from  the  arrow 
discrimination  task  to  cope  with  the  processing  demands  of  the  monitoring 
task  (Freidman  and  Poison,  1981;  Navon  and  Gopher,  1979;  Sanders,  1979; 
Wickens,  1980).  Thus,  some  proportion  of  the  resources  that  had  been  used 
in  the  performance  of  the  arrow  discrimination  task  when  performed  alone 
were  reallocated  to  the  monitoring  task  in  the  dual  task  conditions. 
Additional  resources  needed  to  cope  with  increased  demands  were  withdrawn 
from  the  arrow  discrimination  task  when  subjects  were  required  to  perform 
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the  uncorrelated  version  of  the  monitoring  task. 

The  pattern  of  ERP  results  indicated  that  the  amplitude  of  the  N160 
recorded  at  the  central  and  parietal  sites  was  insensitive  to  intertask 
demands,  while  the  amplitude  of  the  occipital  N190  declined  with  both  the 
introduction  of  the  foveal  task  and  an  increase  in  its  difficulty.  These 
findings  are  interesting  in  several  respects.  First,  they  suggest  that, 
consistent  with  previous  arguements,  the  NlOQs  recorded  at  central-parietal 
and  occipical  sites  differ  in  their  sensitivity  to  attentional  manipulations 
both  within  and  across  tasks  (Harter  and  Aine,  1984;  Van  Voorhis  and 
Hillyard,  1977).  The  N160  appears  to  reflect  the  distribution  of  attention 
to  different  spatial  locations  within  a  task  while  the  N190  is  sensitive  to 
both  intra-  and  intertask  demands,  and  therefore  may  reflect  the 
distribution  of  general  purpose  perceptual  resources.  Second,  the  finding 
that  N190  decreased  as  a  function  of  intertask  demands  indicates  that 
resource  models  which  heretofor  have  been  silent  on  the  temporal  locus  of 
tradeoffs  between  tasks  will  need  to  address  this  issue  (Freldman  and 
Poison,  1981;  Herdman  and  Freidman,  1985;  Kinsbourne  and  Hicks,  1978). 

One  solution  for  these  models  would  be  to  incorporate  a  limited 
resource  attentional  filter  (Treisman,  1969).  In  the  case  of  dual  tasks, 
however,  the  resources  would  need  to  be  shared  not  only  among  different 
elements  within  a  task  but  also  between  tasks.  Additional  support  for  such 
a  filter  is  provided  by  the  task  level  x  attention  interaction.  The  decrese 
in  the  interlocation  attention  effect  in  the  dual  task  conditions  was 
primarily  due  to  a  reduced  amplitude  for  the  attended  field  Nl90s, 
suggesting  that  attention  was  withdrawn  from  the  processing  of  items  in  the 
cued  location  to  cope  with  the  demands  of  the  monitoring  tasks.  This 
pattern  of  results  suggests  that  the  attentional  mechanism  reflected  by  the 
N190  was  capable  of  strategically  allocating  resources  on  the  basis  of 
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processing  priorities  both  within  and  across  tasks. 

It  is  interesting  to  note  that  one  resource  model  in  particular  does 
address  the  issue  of  the  temporal  locus  of  resource  tradeoffs  between  tasks. 
Wickens  (1980)  proposed  a  multiple  resource  model  according  to  which 
attentional  resources  may  be  represented  by  three  dimensions:  stages  of 
processing,  codes  of  processing  and  modalities  of  processing.  The  stage 
dimension  is  further  subdivided  into  perceptual/central  and  response 
resources.  Thus,  two  tasks  may  compete  for  attentional  resources  at  either 
of  these  stages.  P300s  have  previously  been  used  to  indicate  a  demand  for 
perceptual/central  resources  (Isreal  et  al.,  1980b).  Manipulation  of 
response  related  demands  have  only  a  minimal  effect  on  the  amplitude  and 
latency  of  the  P300  (Isreal  et  al.,  1980a,  Magliero,  Bashore,  Coles  and 
Donchin,  1984;  McCarthy  and  Donchin,  1981;  Ragot,  1984).  The  sensitivity  of 
the  N190  to  intertask  demands  suggests  that  tradeoffs  between  tasks  may 
occur  substantially  earlier  than  P300  and  therefore  may  indicate  the  need 
for  a  further  subdivision  of  the  stage  dimension  (eg.  perceptual,  central 
and  response  demands).  Thus,  it  appears  that  the  N190  may  index  the 
allocation  of  perceptual  resources  while  the  P300  may  provide  a  metric  of  a 
combination  of  perceptual  and  central  processing  resources. 

Previous  studies  that  have  investigated  the  sensitivity  of  P300  to 
attentional  tradeoffs  between  tasks  have  focused  primarily  on  the  P300s 
elicited  by  target  stimuli  (Kramer  et  al.,  1983;  Natani  and  Comer,  1981; 
Lindholm  et  al.,  1984;  Sirevaag  et  al.,  1984).  In  the  present  study  we  also 
examined  the  effects  of  dual  task  demands  on  the  intra-  and  interlocation 
selective  processes  reflected  by  P300.  Consistent  with  previous  findings,  a 
main  effect  for  task  level  was  obtained.  The  amplitude  of  the  P300s 
elicited  in  the  peripheral  cask  decreased  with  the  imposition  of  the 
monitoring  task  as  well  as  with  an  increase  in  its  difficulty  from  the 
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correlated  to  the  uncorrelated  system.  However,  the  manner  in  which  P300s 
elicited  by  events  in  the  attended  and  ignored  locations  were  affected 
differed  as  a  function  of  the  level  of  dual  task  demands.  The  imposition  of 
the  monitoring  task  resulted  in  a  decrease  in  the  amplitude  of  P300s 
elicited  by  both  attended  and  unattended  field  stimuli  while  only  the  P300s 
elicited  by  events  in  the  attended  field  were  influenced  by  increased  dual 
task  demands.  These  results  may  indicate  that  the  pace  of  the  peripheral 
task  may  not  have  been  sufficient  (ISI  ■  1.2  to  1.5  sec)  to  require  tightly 
focused  attention  in  the  single  task  condition.  However,  with  the 
Imposition  of  the  foveal  task  it  was  necessary  for  subjects  to  greatly 
reduce  the  attention  they  allocated  to  the  unattended  field  stimuli. 

It  is  interesting  to  note  that  although  this  pattern  of  results  would 
imply  that  subjects  were  capable  of  strategically  altering  their 
distribution  of  attention  to  accomodate  increased  processing  demands,  the 
lack  of  a  significant  stimulus  type  x  task  level  x  attention  interaction 
suggests  that  their  flexibility  may  have  been  limited  to  interlocation 
selective  processes.  Since  P300s  are  larger  not  only  for  attended  field 
items  but  also  for  target  items  within  an  attended  location,  as  they  were  in 
the  single  task  condition  in  the  present  experiment,  it  might  be  expected 
that  subjects  would  reallocate  resources  that  had  been  used  to  process 
nontarget  items  in  the  attended  field  prior  to  reducing  their  attention  to 
target  items.  Within  Broadbentian  (1982)  nomenclature,  it  appears  that 
although  subjects  were  able  to  filter  successfully  on  the  basis  of  stimulus 
set  (eg.  location)  to  cope  with  external  task  demands,  they  did  not  limit 
their  attention  to  specific  items  within  an  attended  location.  Future 
research  will  be  necessary  to  determine  if  such  hierarchical  filtering  is  a 
viable  strategy  in  other  multi-task  situations. 

The  Zoom  Lens  model  has  been  proposed  to  account  for  differences  in  the 
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speed  and  precision  of  processing  during  focused  and  divided  attention  in 
the  visual  modality  (Eriksen  and  Saint  James,  1986).  It  has  been  argued 
that  a  fixed  quantity  of  attentional  resources  is  available  for  the 
processing  of  items  in  the  visual  field  and  chat  the  performance  differences 
observed  in  focused  and  divided  attention  can  be  attributed  to  the  density 
of  resources  allocated  to  task  relevant  events.  Although  the  model  was 
specifically  designed  to  account  for  the  distribution  of  attentional 
resources  within  a  task,  the  results  of  the  present  study  suggest  that  it 
might  apply  equally  well  to  divided  attention  between  tasks.  The  N160,  N 190 
and  P300  discriminated  between  cued  and  uncued  visual  fields  during  focused 
attention.  The  requirement  to  divide  attention  between  two  different  tasks 
reduced  the  amplitude  of  the  N190s  and  P300s  elicited  by  items  in  the 
attended  field.  Performance  also  decreased  from  the  focused  to  the  divided 
attention  conditions.  This  pattern  of  results  is  consistent  with  the 
prediction  that  a  reduction  in  the  magnitude  of  resources  allocated  to  the 
processing  of  visual  events,  as  Indicated  by  the  amplitude  of  the  N19Q  and 
P300,  results  in  a  decrement  in  performance.  The  present  results  suggest 
that  this  prediction  is  fulfilled  when  attention  is  distributed  between  as 
well  as  within  tasks. 

Previous  studies  have  found  that  NIOOs  elicited  by  stimuli  lateral  to 
the  attended  location  progressively  decrease  in  amplitude  with  increasing 
distance  (Mangun  and  Hillyard,  1987).  These  ERP  results  are  consistent  with 
other  studies  that  have  found  increased  RTs  and  decreased  accuracies  as 
unexpected  stimuli  that  require  a  response  are  presented  at  increasing 
distances  from  an  attended  location  (Downing  and  Pinker,  1985;  Eriksen  and 
Saint  James,  1986;  LaBerge  and  Brown,  1986).  Taken  together,  the  behavioral 
and  ERP  studies  suggest  that  the  spatial  filter  functions  with  a  gradual 
dropoff  of  processing  resources  with  increasing  distance  from  the  focus  of 
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attention.  The  results  of  the  present  study  are  consistent  with  this 
conception  of  the  visual-spatial  filter.  The  differences  in  the  N160,  N190 
and  P300's  elicited  by  stimuli  in  the  attended  and  unattended  fields  were 
magnified  with  increases  in  the  spatial  separation  of  the  stimuli  from  2.8 
to  18.2  degrees. 

There  has  been  some  controversy  as  to  the  effects  of  foveal  task 
difficulty  on  the  processing  of  information  in  the  visual  periphery. 
Proponents  of  the  Cognitive  Tunnel  Vision  model  assert  that  the  functional 
visual  field  constricts  with  increasing  foveal  task  demands  (Ikeda  and 
Takcuchi,  1975;  Mackvorth,  1965;  Williams,  1985).  In  terms  of  a  joint 
ERP/performance  analysis  this  would  suggest  that  (a)  with  increases  in  the 
difficulty  of  the  foveal  task,  performance  in  the  peripheral  should  decrease 
in  proportion  to  the  distance  of  the  stimuli  from  the  fovea  and,  (b)  the 
N160,  N190  and  P300  attentional  gradient  effects  should  interact  with  the 
difficulty  of  the  foveal  task  such  that  the  difference  between  the 
components  elicited  by  the  attended  and  unattended  field  stimuli  should 
dropoff  more  quickly  at  the  distant  locations.  Neither  of  these  effects 
were  obtained  in  the  present  experiment.  Thus,  our  results  provide  support 
for  the  counter  arguement  that  asserts  chat  stimuli  in  the  visual  periphery 
suffer  from  the  division  of  attention  between  a  foveal  and  peripheral  task, 
irrespective  of  their  physical  distance  from  the  fovea.  There  were, 
however,  several  methodological  differences  between  the  present  experiment 
and  those  that  have  obtained  support  for  the  Cognitive  Tunnel  Vision  model. 
These  differences  included  the  use  of  a  continuous  rather  than  a  discrete 
foveal  task  and  a  substantially  longer  presentation  of  peripheral  stimuli 
(50  versus  10  msec).  Although,  given  the  differences  in  methodologies  it  is 
impossible  for  us  assert  that  Cognitive  Tunnel  vision  does  not  occur,  the 
necessity  for  either  of  these  conditions  would  appear  to  greatly  limit  the 
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generalizability  of  the  phenomenon 
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Figure  1  A  graphic  representation  of  the  triangle  monitoring  and  arrow 
discrimination  tasks.  The  different  versions  of  the  triangle  monitoring 
task  are  also  indicated  for  both  normal  and  failure  states. 

Figure  2_  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  Fz  site. 

Figure  _3  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  C z  site. 
Figure  4_  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  Pz  site. 
Figure  _5  Grand  average  waveforms  elicited  by  the  targets  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  Pz  site. 
Figure  h_  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  ipsilateral 
sites. 

Figure  7  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  Oz  site. 
Figure  8^  Grand  average  waveforms  elicited  by  the  standards  in  each  of  the 
experimental  conditions.  The  waveforms  were  recorded  at  the  contralateral 
sites. 

Figure  9^  Grand  average  target  waveforms  for  each  of  the  midline  sites  in 
the  single  and  dual  task  conditions. 

Figure  10  Grand  average  target  waveforms  for  each  of  the  attended  field 
stimuli  for  each  of  the  task  levels  and  retinal  eccentricities. 
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Footnotes 

1.  Since  the  single  and  dual  task  monitoring  conditions  were  not  directly 
comparable  due  to  the  levels  of  retinal  eccentricity  of  arrow  presentation 
in  the  dual  task  conditions,  three  separate  ANOVA's  were  performed.  Single 
task  monitoring  performance  was  compared  to  monitoring  performance  when 
paired  with  close,  middle  and  far  arrow  presentations.  Single  and  dual  task 
measures  did  not  differ  in  any  of  these  comparisons. 

2.  All  post-hoc  comparisons  reported  in  the  article  are  computed  with  the 
Bonferroni  t-test  and  are  significant  at  p<.05. 
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Table  1 


Average  RT  (msec),  number  of  hits,  misses  and  FA's  for  the  triangle 
monitoring  task  In  the  dual  task  conditions  for  each  of  the  three 
retinal  eccentricities  of  the  arrow  stimuli. 


Measures 

Correlated 

Condition 

Uncorrelated  Condition 

Close 

Middle 

Far 

Close 

Middle 

Far 

Average 

RT 

1720 

1634 

1837 

4356 

4194 

4339 

Average 

Hits 

49 

49 

50 

41 

38 

40 

Average 

Misses  .6 

.4 

.5 

3.8 

3.6 

3.0 

Average 

FA's 

1.1 

1.6 

1.6 

12.7 

13.3 

12.5 
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Table  2 

Average  RT  and  P300  latency  (in  parentheses)  in  msec  obtained  at  the 
lateral  sites  for  the  arrow  discrimination  task  in  both 
the  single  and  dual  task  conditions  for  each  of  the 
retinal  eccentricities. 

Retinal  Eccentricities 


Single  Task 
Dual  Correlated 


Close 
399  (386) 
476  (455) 


Middle 
419  (435) 
478  (463) 


Far 

445  (461) 
506  (476) 
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Dual  Uncorrelated 


494  (460) 


492  (471) 


506  (511) 
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Table  3 


Grand  average  N160  amplitude  measures  in  microvolts  for  Cz 
and  Pz  electrode  sites  and  N190  measures  at  occipital 
recording  sites  for  the  attended  and  ignored 
stimuli  at  the  three  retinal  eccentricities. 


Retinal  Eccentricity 


Electrode 

Close 

Middle 

Far 

Site 

Attend 

Ignore 

Attend 

Ignore 

Attend 

Ignore 

Cz 

1.9 

1.7 

2.3 

1.8 

2.6 

1.5 

Pz 

2.0 

1.7 

2.4 

1.9 

2.8 

1.6 

Ip6ilateral 

2.1 

2.0 

2.4 

2.5 

2.3 

2.2 

Oz 

2.3 

2.1 

2.9 

1.9 

3.1 

1.7 

Contralateral 

2.8 

2.2 

3.2 

1.9 

3.6 

1.7 
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Table  4 


Grand  average  N190  amplitude  measures  in  microvolts  for 
Oz,  ipislateral  and  contralateral  sites  for  the  three 
task  levels  and  the  attended  and  ignored  stimuli 


Task 

Ipsilateral 

Oz 

Contralateral 

Level 

Attend 

Ignore 

At  t  end 

Ignore 

Attend 

Ignore 

Single 

2.7 

2.6 

3.0 

1.9 

3.8 

1.9 

Dual-Correlated 

1.7 

1.8 

1.9 

1.7 

2.4 

1.7 

Dual-Independ ent 

1.8 

1.8 

2.0 

1.7 

2.5 

1.6 
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Table  5 

Grand  average  P300  amplitude  measures  in  microvolts  at  the 
midline  and  combined  ipsi/contralateral  sites  for  stimuli  in 
the  attended  and  nonattended  fields. 


Electrode 

Targ 

Attend 

eta 

Ignore 

Standards 
Attend  It 

Fz 

7.0 

3.1 

4.0 

Cz 

11.9 

3.9 

5.1 

Pz 

16.7 

4.1 

5.0 

Oz 

9.5 

3.8 

3.9 

Ipsi/ contra 

9. A 

2.9 

2.9 
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Table  6 

Grand  average  P300  amplitude  measures  in  microvolts  for 
the  attended  and  ignored  field  stimuli  at  each  of 
the  retinal  eccentricities.  The  amplitude  values  are 
collapsed  over  targets  and  standards. 


Retinal 

Eccentricity  Attend  Ignore 

Close  7.4  3.5 


Middle 


7.8 


2.7 
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Table  7 

Grand  average  P300  amplitude  measures  in  microvolts  for 
the  attended  and  ignored  field  stimuli  for  each  of 
the  task  levels.  The  amplitude  values  are  collapsed 
over  targets  and  standards. 


Task 

Level 

Attend 

Ignore 

Single 

11.6 

3.5 

Dual-Correlated 

9.2 

2.1 

Dual-Independent 


8.0 


2.1 


msec 


Ipsilateral  Standards 


msec 


msec 


Contralateral  Standards 


msec 
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Abstract 


Previous  research  has  found  that  properties  of  automatic  processing  do 
not  always  co-occur,  suggesting  that  the  acquisition  rates  may  differ.  The 
present  study  investigated  the  acquisition  rate  of  several  of  these 
properties  by  employing  additive  factors  logic,  dual  task  methodology,  and 
event-related  brain  potentials.  Seven  subjects  participated  in  a  ten 
session  experiment  in  which  they  performed  two  tasks,  a  visual/memory  search 
task  and  a  pursuit  step  tracking  task,  both  together  and  separately.  RT  and 
P300  latency  measures  indicated  that  parallel  processing  of  the  display  was 
achieved  early  in  training  in  the  consistent  mapping  condition.  This 
processing  was  unaffected  by  dual  task  demands.  An  analysis  of  RT/P300 
ratios  suggests  that  another  form  of  perceptual  efficiency  was  achieved 
later  in  practice  in  both  the  varied  and  consistently  mapped  search  tasks. 
This  effect  was  larger  in  the  consistent  mapping  condition.  Reductions  in 
the  slope  of  the  memory  set  function  occurred  significantly  earlier  for  P300 
latency  than  for  RT ,  suggesting  that  the  stimulus  evaluation  processes 
became  automated  more  rapidly  than  the  response  selection  components  of 
memory  search.  Consistent  with  an  analysis  of  the  processing  demands  of  the 
two  tasks,  the  introduction  of  the  tracking  task  and  an  increase  in  tracking 
difficulty  produced  equivalent  interference  during  consistent  and  varied 
mapping  conditions.  Results  are  discussed  in  terms  of  models  skill 
acquisition  and  component  task  automaticity. 
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Assessing  the  Development  of  Automatic  Processing:  An  Application  of 
Dual-Task  and  Event-Related  Brain  Potential  Methodologies 

In  recent  years  a  number  of  theories  have  been  proposed  to  account  for 
the  quantitative  and  qualitative  changes  in  performance  and  subject 
strategies  that  occur  during  the  development  of  highly  skilled  behaviors 
(LaBerge,  1981;  Logan,  in  press;  Neumann,  1984;  Posner  and  Snyder,  1975; 
Schneider  and  Shiffrin,  1977).  It  is  interesting  to  note  that  many  of  the 
characteristics  of  these  "automatic"  processes  were  described  over  80  years 
ago.  Solomons  and  Stein  (1896),  in  their  introspective  studies  of  reading 
and  writing,  described  the  development  of  automatic  processing  in  terms  of  a 
rapid  increase  in  the  speed  of  performance,  a  reduction  of  the  effort  and 
attention  required  to  perform  the  task,  a  lack  of  memory  for  automatically 
processed  events,  and  poor  development  of  automatic  processing  in  situations 
in  which  difficult  perceptual  discriminations  were  necessary.  Bryan  and 
Harter's  (1897,  1899)  studies  of  professional  telegraphers  allowed  them  to 
add  several  other  characteristics  to  this  list.  They  described  the  highly 
skilled  telegraphers'  performance  as  being  resistant  to  intrusions  and 
highly  stereotypical.  Although  a  few  additional  characteristics  have  been 
added  to  the  list  in  recent  years  (e.g.,  difficult  to  modify,  activation 
without  intention;  Shiffrin  and  Dumais,  1981),  the  observations  made  at  the 
turn  of  the  century  still  provide  the  core  set'  of  attributes  that  typify 
automatic  processing. 

One  of  the  goals  of  the  present  study  is  to  provide  a  fine  grained 
analysis  of  the  changes  in  information  processing  that  occur  during  the 
development  of  highly  skilled  behavior.  To  this  end  we  have  employed 
several  methodological  approaches,  including  dual-task  techniques,  additive 
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factors  logic  and  event-related  brain  potentials  (ERP),  in  the  assessment  of 
the  changes  in  subject's  performance  and  processing  strategies  over  an 
extended  period  of  training.  A  second  issue  that  is  addressed  in  the 
present  study  is  the  degree  to  which  automatic  processing  can  be  localized 
to  specific  information  processing  components  within  highly  practiced  tasks. 
Dev elopment  of  Automatic  Processing 

A  paradigm  that  has  been  extensively  employed  in  the  examination  of 
automaticity  is  a  modification  of  the  Sternberg  memory  search  task  (1966, 
1969).  In  this  paradigm  subjects  are  instructed  to  memorize  a  set  of  items. 
After  memorization  has  been  completed,  subjects  compare  a  series  of  visually 
presented  probes  to  the  members  of  the  memory  set.  Subjects  make  one 
response  if  one  of  the  visually  presented  items  is  from  the  memory  set  and 
another  response  if  the  probe(s)  do  not  match  any  of  the  memory  set  items. 
Probes  that  match  an  item  in  the  memory  set  are  referred  to  as  targets  while 
probes  that  do  not  match  a  memory  set  item  are  labeled  distractors.  In  an 
extensive  series  of  studies,  Schneider  and  Shiffrin  (1977;  see  also  Shiffrin 
and  Schneider,  1977)  have  demonstrated  that  automatic  processing  develops  in 
a  consistent  mapping  condition  (CM)  in  which  targets  are  always  selected 
from  one  set  of  items  (e.g.,  letters  A  to  M)  and  distractors  are  selected 
from  another  set  of  items  (e.g.,  letters  N  to  Z).  Thus,  the  mapping  of  the 
stimuli  to  the  responses  does  not  vary  over  trials  in  the  CM  conditions. 
Non-automatic  or  controlled  processing  is  employed  in  a  varied  mapping  (VM) 
condition  in  which  subjects  are  unable  to  consistently  map  stimuli  to 
responses.  In  the  VM  conditions  both  targets  and  distractors  are  chosen 
from  the  same  set  of  items  (e.g.,  letters  A  to  Z).  Thus,  targets  and 
distractors  exchange  roles  over  trials  in  the  VM  conditions. 

As  a  result  of  their  studies,  Schneider  and  Shiffrin  (1977)  proposed  a 
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two-process  theory  to  account  for  changes  in  performance  with  practice. 
Automatic  processing  which  developed  as  a  result  of  extensive  practice  with 
consistent  stimulus-response  relations  was  characterized  as  fast, 
inflexible,  difficult  to  suppress  once  learned,  and  not  limited  by 
short-term  memory  capacity  or  attention.  It  was  suggested  that  controlled 
processing  occurs  in  novel  situations  or  in  situations  in  which 
stimulus-response  relations  are  inconsistent  over  time.  Controlled 
processing  was  characterized  as  slow,  serial  and  capacity  limited. 

Asymptotic  controlled  processing  requires  little  practice  and  is  easily 
modified.  The  qualitative  and  quantitative  differences  between  automatic 
and  controlled  processing  modes  have  been  demonstrated  in  search  tasks  with 
a  variety  of  stimulus  materials  including;  characters  (Shiffrin  and 
Schneider,  1977),  words  (Fisk  and  Schneider,  1983),  categories  (Fisk  and 
Schneider,  1984a),  spatial  and  temporal  patterns  (Eberts  and  Schneider, 

1986;  Myers  and  Fisk,  1987)  and  higher-order  rules  (Fisk  and  Oransky,  in 
press;  Kramer  and  Strayer,  1987a). 

A  number  of  criteria  have  been  employed  in  the  memory/visual  search 
task  to  distinguish  between  automatic  and  controlled  processing  modes.  One 
of  these  is  referred  to  as  the  "zero  slope"  criterion.  This  criterion  is 
satisfied  when  the  memory  or  visual  search  slope  is  reduced  to  less  than  10 
msec  per  item  in  a  character  classification  task  (Schneider,  1985).  This 
can  be  compared  with  a  40  msec  slope  in  an  equally  practiced  VM  condition. 
Two  other  criteria  were  developed  within  the  context  of  dual  task 
experiments  in  which  a  CM  task  is  paired  with  a  VM  task.  One  of  the 
criteria  deals  with  performance  in  the  CM  task.  The  "perfect  time  sharing" 
criterion  is  fulfilled  when  the  performance  in  the  dual  task  condition  is 
equivalent  to  performance  in  the  single  task  CM  condition.  Thus,  in  this 
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case  CM  performance  is  insensitive  to  the  imposition  of  the  VM  task  or  an 
increase  in  its  difficulty.  The  second  of  the  dual  task  criteria  deals  with 
performance  in  the  VM  task.  The  "intrusion"  criterion  is  satisfied  when 
performance  in  the  VM  task  is  reduced  with  the  occurrence  of  a  CM  target, 
despite  subjects'  attempts  to  ignore  the  CM  event.  The  co-occurrence  of 
these  criteria  has  been  used  to  evaluate  the  internal  consistency  of 
automatic  processing.  However,  several  investigators  have  noted  that  these 
criteria  do  not  always  co-occur  (K.ahneraan  and  Chajzyck,  1983;  Paap  and 
Ogden,  1981;  Regan,  1981).  This  led  Logan  (1985)  to  suggest  that  automatic 
processing  might  be  better  characterized  by  a  continuum  than  a  dichotomy 
since  the  properties  of  automatic  processing  appear  to  develop  at  different 
rates  during  training. 

Schneider's  (1985)  recent  extension  of  the  two-process  theory 
(Schneider  and  Shiffrin,  1977;  Shiffrin  and  Schneider,  1977)  provides  a 
description  of  the  changes  that  occur  in  performance  with  the  development  of 
automaticity.  According  to  Schneider,  the  transition  takes  place  in  four 
phases.  In  the  first  phase  controlled  processing  predominates.  This  phase 
is  characterized  by  the  effect  of  memory  load  on  performance  in  the  search 
task.  In  phase  two,  controlled  and  automatic  processing  co-occur  and 
performance  is  a  mixture  of  the  two  processing  modes.  This  phase  occurs 
shortly  after  the  introduction  of  consistent  practice  and  is  characterized 
by  a  flattening  of  the  memory  load  function  for  the  larger  set  sizes.  It  is 
proposed  that  the  reduction  in  slope  for  the  larger  set  sizes  is  due  to  the 
weak  automatic  processing  finishing  first  when  controlled  processing  is 
relatively  slow.  The  third  phase  is  characterized  by  a  lack  of  effect  of 
memory  load  on  performance  (i.e.  zero  slope).  The  controlled  sequential 
operations  are  no  longer  necessary  due  to  the  increased  strength  of  the 


Development  of  Automatic  Processing 


Page  5 


automatic  processing;  however,  subjects  still  allocate  attention  to  perform 
the  task.  The  fourth  phase  represents  pure  automatic  processing  and  is 
characterized  by  a  zero  slope  and  perfect  time  sharing.  In  the  present 
study  we  will  examine  the  transitions  in  processing  predicted  by  Schneider's 
(1985)  model.  This  will  be  accomplished  by  using  ERPs  in  conjunction  with 
performance  measures  to  localize  the  changes  in  information  processing  that 
take  place  during  the  development  of  automaticity.  For  example,  the  joint 
use  of  RT  and  P300  latency  will  allow  us  to  distinguish  between  stimulus 
evaluation  and  response  processing  interpretations  of  changes  in  the 
Sternberg  memory  set  slope  and  intercept  with  increased  practice  and  dual 
task  demands. 

Component  Task  Automaticity 

In  a  recent  review  of  the  automaticity  literature,  Jonides, 

Naveh-Benj amin  and  Palmer  (1985)  have  suggested  that,  "the  investigation  of 
the  task  as  a  whole  may  lead  one  to  conclude  that  the  task  is  not  performed 
automatically,  when  one  or  more  component  processes  used  in  completing  the 
task  may,  in  fact,  be  automatic  by  acceptable  criteria"  (p.  163).  Other 
investigators  have  also  emphasized  the  importance  of  a  careful  analysis  of 
task  structure  prior  to  the  assessment  of  automaticity.  For  example, 

LaBerge  (1975,  1981)  distinguishes  between  two  types  of  automatic  processing 
that  develop  under  different  conditions,  within  system  or  perceptual 
automaticity  and  between  system  or  association  automaticity.  Fisk  and 
Schneider  (1984b)  argue  that  automatic  processing  can  develop  for  consistent 
components  of  a  task  even  when  the  entire  task  is  not  consistent.  In  an 
Ingenious  series  of  studies  Logan  (1979)  used  additive  factors  logic  within 
a  dual  task  paradigm  to  examine  the  relative  automaticity  of  different 
component  processes  (see  also  Logan,  1978).  In  these  studies  VM  and  CM 
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choice  reaction  time  tasks  were  paired  with  a  concurrent  memory  retention 
task.  Initially,  performance  in  both  the  CM  and  VM  tasks  interacted  with 
increases  in  the  difficulty  of  the  concurrent  task.  However,  following 
practice  performance  in  the  VM  task  continued  to  interact  with  increases  in 
the  difficulty  of  the  concurrent  task  while  performance  in  the  CM  task  was 
additive  with  dual  task  demands.  This  reduced  dual  task  interference 
suggests  that  the  automatic  processing  did  not  place  demands  on  the  same 
limited  capacity  processes  required  by  the  concurrent  memory  retention  task. 

The  interpretation  of  the  dual  task  results  in  terms  of  overlapping 
demands  is  consistent  with  resource  models  of  attention  which  assume  that 
processing  resources  are  limited  in  quantity  and  shareable  between 
concurrently  performed  tasks  (Kahneman,  1973;  Norman  and  Bobrow,  1975; 
Kantowitz  and  Knight,  1976).  Current  formulations  of  resource  theory  hold 
that  a  number  of  processing  units  have  their  own  supply  of  resources  that 
can  be  shared  by  several  ongoing  cognitive  operations  (Freidman  and  Poison, 
1981;  Kinsbourne  and  Hicks,  1978;  Navon  and  Gopher,  1979;  Sanders,  1983). 
One  such  model  proposed  by  Wickens  (1980)  argues  that  processing  resources 
may  be  represented  by  three  dimensions:  stages  of  processing 
( perceptual/central  and  response),  codes  of  processing  (verbal  and  spatial) 
and  modalities  (visual  and  auditory).  Tasks  that  place  demands  on  the  same 
limited  capacity  processes  are  predicted  to  be  more  poorly  timeshared  than 
tasks  that  do  not  overlap  in  their  processing  requirements. 

The  multiple  resource  interpretation  of  the  pattern  of  dual  task 
interactions  is  consistent  with  the  observation  that  highly  practiced  CM 
tasks  can  be  timeshared  with  less  interference  from  some  types  of  tasks 
(Hirst,  Spelke,  Channick  and  Neisser,  1980;  Schneider  and  Fisk,  1982; 
Solomons  and  Stein,  1896)  than  others  (Hoffman,  Nelson,  and  Houck,  1983; 
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Hoffman,  Houck,  MacMillian,  Simons,  and  Oatman,  1985).  In  fact,  Hoffman  et 
al.  (1985)  have  suggested  that  CM  memory/visual  search  tasks  may  still  place 
demands  on  two  separate  limited  capacity  processes:  one  concerned  with  the 
episodic  representation  of  the  stimulus  array  and  the  other  with  the 
production  of  speeded  responses.  In  the  present  experiment  we  will  further 
examine  the  issue  of  component  task  automaticity  by  pairing  a  pursuit  step 
tracking  task  which  has  previously  been  shown  to  place  heavy  demands  on 
perceptual  and  motor  (Vidulich  and  Wickens,  1981;  Wickens,  Derrick, 

Micallizi  and  Beringer,  1980)  processes  with  the  Sternberg  memory  search 
task.  It  is  predicted  that  the  imposition  of  the  tracking  task  and  an 
increase  in  its  difficulty  will  interact  with  the  intercept  of  the  memory 
set  function  in  the  Sternberg  task  but  will  be  additive  with  the  slope. 

This  prediction  is  based  upon  additive  factors  logic  (Sternberg,  1969), 
which  suggests  that  the  intercept  in  the  memory  comparison  task  reflects 
both  encoding  and  responses  processes  while  the  slope  provides  an  index  of 
the  time  required  to  complete  memory  comparison  processes.  Furthermore,  we 
predict  that  the  magnitude  of  the  interaction  between  the  tracking  task  and 
the  Sternberg  task  will  be  the  same  for  VM  and  CM  conditions  since  it  is  the 
memory  comparison  process  that  becomes  automated  in  the  memory  search  task. 
Event-Related  Brain  Potentials 

The  ERP  is  a  transient  series  of  voltage  oscillations  in  the  brain  that 
can  be  recorded  in  response  to  the  occurrence  of  a  discrete  event  (Donchin, 
1981;  Regan,  1972).  The  ERP  can  be  partitioned  into  a  number  of  separate 
components  identified  by  the  polarity  and  approximate  latency  of  the  peak 
(Donchin,  Ritter  and  McCallum,  1978).  Two  separate  components,  the  N200  and 
P300,  will  be  examined  in  the  present  study  in  an  effort  to  aid  in  the 
explication  of  the  processing  changes  that  take  place  during  the  development 
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of  automatic  processing  in  single  and  dual  task  conditions. 

The  P300  component  of  the  ERP  is  represented  by  a  positive  voltage 
deflection  maximal  over  the  parietal  scalp  with  a  minimal  peak  latency  of 
300  msec  post-stimulus  (Sutton,  Braren,  Zubin  and  John,  1965).  This  ERP 
component  is  particularly  useful  in  the  study  of  automatic  processing  since 
its  latency  appears  to  be  influenced  by  stimulus  evaluation  processes  while 
being  relatively  unaffected  by  the  processes  of  response  selection  and 
execution  (Magliero,  Bashore,  Coles  and  Donchin,  1984;  McCarthy  and  Donchin, 
1981;  Ragot,  1984).  Thus,  the  use  of  P300  latency  in  conjunction  with  RT 
will  allow  us  to  isolate  processing  changes  to  pre  and  post  response 
processes . 

A  number  of  studies  have  jointly  examined  reaction  time  and  P300 
latency  in  a  VM  Sternberg  task  (e.g.,Adam  and  Collins,  1978;  Brookhuis, 
Mulder,  Mulder,  and  Gloerich,  1983;  Ford,  Pfefferbaum,  Tinklenberg,  and 
Kopell,  1982;  Ford,  Roth,  Mohs,  Hopkins,  and  Kopell,  1979;  Gomer,  Spicuzza, 
and  O'Donnel,  1976;  Pfefferbaum,  Ford,  Roth,  &  Kopell,  1980;  Strayer, 
Wickens,  and  Braune,  1987).  The  consensus  of  these  studies  is  that  both 
reaction  time  and  P300  latency  increase  with  memory  load;  however,  the 
effects  were  greater  for  reaction  time.  This  suggests  that  the  reaction 
time  slope  contains  both  a  stimulus  evaluation  component  and  a 
response-related  component  (cf.  Marcel,  1976). 

P300  amplitude  has  been  found  useful  in  the  study  of  the  allocation  of 
processing  resources  among  concurrently  performed  tasks.  Within  a  dual  task 
paradigm,  P300  has  been  found  to  increase  in  amplitude  with  increased 
processing  demands  when  elicited  by  task  relevant  events  in  a  primary  task. 
On  the  other  hand,  P300s  elicited  by  secondary  task  events  decrease  in 
amplitude  with  increases  in  the  perceptual/cognitive  difficulty  of  a  primary 
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task  (Isreal,  Chesney,  Wickens  and  Donchin,  1980;  Isreal,  Wickens,  Chesney, 
and  Donchin,  1980;  Kramer,  Sirevaag  and  Braune,  1987;  Kramer,  Wickens  and 
Donchin,  1983,  1985;  Sirevaag,  Kramer,  Coles  and  Donchin,  1984;  Wickens, 
Kramer,  Vanasse  and  Donchin,  1983).  This  pattern  of  changes  in  P300 
amplitude  is  consistent  with  predictions  of  resource  models  of  attentional 
allocation  (Navon  and  Gopher,  1979).  Thus,  it  appears  that  while  P300 
latency  provides  information  concerning  the  mental  chronometry  of 
information  processing,  P300  amplitude  is  sensitive  to  changes  in  the 
resource  demands  of  processes. 

The  N200  component  of  the  ERP  is  sensitive  to  the  degree  of  mismatch 
between  stimulus  events  (Naatanen,  Simpson  and  Loveless,  1982;  Naatanen  and 
Gaillard,  1983;  Ritter  et  al.,  1984).  In  addition  to  its  sensitivity  to 
physical  mismatch,  N200s  have  also  been  found  to  be  elicited  by 
orthographic,  phonological  and  semantic  mismatches  (Kramer  and  Donchin, 

1987;  Sandquist,  Rohrbaugh,  Syndulko  and  Lindsley,  1980).  Naatanen  and 
Picton  (1986)  have  suggested  that  the  N200  component  is  composed  of  a  number 
of  temporally  overlapping  subcomponents,  some  of  which  are  elicited  only 
during  controlled  processing  (e.g.  N2b),  and  others  which  are  evoked  during 
automatic  processing  (e.g.  mismatch  negativity).  In  the  present  study  the 
N200  will  be  used  to  provide  an  index  of  the  changes  in  mismatch  processing 
as  a  function  of  the  level  of  automatic  processing  and  dual  task  demands. 

A  number  of  recent  studies  of  automatic  processing  have  used  ERPs  as 
dependent  measures.  Hoffman,  Simmons,  and  Houck  (.1983)  and  van  Dellen, 
Brookhuis,  Mulder,  Okita,  and  Mulder  (1985)  employed  ERPs  to  assess  changes 
in  information  processing,  both  in  terms  of  the  timing  of  information 
processes  as  well  as  their  resource  demands,  during  automatic  detection. 
Subjects  searched  visual  arrays  for  predefined  targets.  Several  interesting 
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results  were  obtained.  Both  research  groups  found  that  in  the  VM  visual 
search  condition  RT  and  P300  latency  increased  with  larger  visual  arrays. 
However,  the  RT  and  P300  latency  display  size  effects  were  significantly 
smaller  in  the  CM  than  the  VM  conditions.  These  results  suggest  that  both 
stimulus  evaluation  and  response  processes  are  reduced  during  automatic 
detection.  Both  groups  of  investigators  also  found  that  P300  amplitudes  were 
not  significantly  different  in  the  CM  and  VM  conditions.  Within  a  resource 
model  framework  these  results  argue  that  automatic  processing  does  appear  to 
consume  at  least  one  type  of  perceptual/cognitive  processing  resource. 
Finally,  van  Dellen  et  al.  (1985)  found  that  the  difference  between  target 
and  nontarget  ERPs  occurred  much  earlier  in  the  CM  than  VM  condition.  These 
results  were  interpreted  in  terms  of  more  efficient  perceptual  coding  in  the 
CM  task.  Kramer,  Schneider,  Fisk  and  Donchin  (1986)  obtained  essentially 
the  same  P300  results  in  a  memory  search  paradigm.  Neither  P300  latency 
nor  RT  was  influenced  by  memory  set  size  in  the  practiced  CM  conditions. 

P300  amplitude  did  not  differ  in  the  CM  and  VM  tasks. 

Hoffman  et  al.  (1985)  employed  ERPs,  and  more  specifically  the  P300 ,  to 
provide  converging  evidence  for  the  role  of  attentional  resources  in  the 
automatic  detection  task.  A  Sternberg  memory  search  task  was  paired  with  a 
dot  detection  task.  Subjects  were  required  to  make  a  detection  response  for 
each  task  on  every  trial.  Processing  priority  was  manipulated  by 
instructing  subjects  to  emphasize  one  or  the  other  task  or  treat  them  both 
equally.  Performance  measures  and  P300  amplitude  showed  a  tradeoff  as  a 
function  of  processing  priority,  suggesting  that  with  this  particular  task 


combination  processing  resources  were  employed  during  automatic  processing. 

The  present  study  was  designed  to  extend  these  results  in  several  ways. 
First,  we  have  recorded  ERPs  both  during  and  after  extensive  practice  in  CM 
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and  VM  memory  search  tasks  in  order  to  track  the  changes  in  information 
processing  that  accompany  the  development  of  automatic  processing.  Second, 
in  addition  to  the  manipulation  of  memory  set  size,  we  have  employed  a 
display  size  of  two  items  arranged  horizontally.  This  will  allow  us  to 
examine  the  development  of  automatic  processing  in  both  visual  and  memory 
search  components  of  the  task.  Third,  we  have  employed  a  concurrent  task, 
which  on  the  basis  of  previous  research,  is  predicted  to  interact 
selectively  with  perceptual  and  response  but  not  memory  comparison 
components  of  the  search  task.  This  will  enable  us  to  assess  the  effects  of 
component  specific  dual  task  demands  on  the  development  of  automatic 
processing  in  the  search  task. 

Method 


Subj ects 

Seven  right  handed  persons  (4  male  and  3  female),  aged  22  to  27  years, 
were  recruited  from  the  student  population  at  the  University  of  Illinois  and 
paid  for  their  participation  in  the  study.  None  of  the  students  had  any 
prior  experience  with  either  of  the  experimental  tasks.  All  of  the  subjects 
had  normal  or  corrected  to  normal  vision.  Each  of  the  subjects  participated 
in  the  ten  experimental  sessions. 


Insert  Figure  1  about  here 


Step  Tracking  and  Sternberg  Tasks 

The  single  axis  pursuit  step  tracking  task  is  illustrated,  along  with 
the  Sternberg  probes,  in  Figure  l.  The  tracking  display  which  consisted  of 
the  computer  driven  target  and  the  subject  controlled  cursor  was  presented 
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on  a  Hewlett  Packard  CRT  which  was  positioned  approximately  70  cm  from  the 
subjects.  The  rectangular  target  was  1.5  cm  x  1.1  cm  in  size  and  subtended  a 
visual  angle  of  1.2  degrees  horizontally  and  .9  degrees  vertically.  The 
cursor  consisted  of  one  vertical  and  two  horizontal  .8  cm  lines,  and 
subtended  a  visual  angle  of  2.4  degrees  horizontally  and  .9  degrees 
vertically.  The  target  changed  its  position  along  the  horizontal  axis  once 
every  3.0  sec  and  the  subjects'  task  was  to  nullify  the  position  error 
between  the  target  and  cursor.  The  target  could  jump  anywhere  along  the 
horizontal  axis.  The  magnitude  and  direction  of  the  jump  were  randomly 
determined  on  each  trial.  The  cursor  was  controlled  by  manipulating  a 
joystick  with  the  right  hand.  Single  task  tracking  blocks  were  comprised  of 
100  step  changes  and  lasted  approximately  five  min.  Although  changes  in  the 
spatial  position  of  the  target  were  discrete  events,  the  tracking  task  was 
performed  continuously  since  the  subjects  were  required  to  constantly 
manipulate  the  joystick  to  nullify  the  position  error  between  the  target  and 
cursor.  The  high  gain  of  the  tracking  system  necessitated  constant  movement 
of  the  joystick  to  control  the  position  of  the  cursor. 

The  dynamics  for  the  tracking  stick  were  composed  of  a  linear 
combination  of  first  order  (velocity)  and  second  order  (acceleration) 
components.  That  is,  the  system  output,  X(t),  is  represented  by  the 
following  equation. 

X(t)  -  [(  1-a)  f  u(t)  dt]  +  [  (  a)  jf'ii  t)  dt] 

where:  u  =*  stick  position;  t  *  time  and  a  *=  difficulty  level. 

The  task  was  conducted  at  two  different  levels  of  the  system  order 
manipulation:  (1)  in  the  first  order  (velocity)  condition  a  was  set  to  zero 
while  (2)  in  the  second  order  (acceleration)  condition  a_  was  set  to  1.0. 
Numerous  investigators  have  validated  the  increasing  difficulty  associated 


Development  of  Automatic  Processing 


Page  13 


with  higher  order  control  (Kramer  et  al.,  1983;  North,  1977;  Trumbo,  Noble 
and  Swink,  1967;  Vidulich  and  Wickens,  1981).  Converging  evidence  employing 
Sternberg's  additive  factors  paradigm  indicates  that  the  demands  of  higher 
order  tracking  are  both  perceptual  and  motor  in  nature,  given  the 
requirement  to  process  higher  derivatives  of  the  error  signal  to  maintain 
stable  control  (Wickens  et  al.,  1980) 

In  the  Sternberg  task,  subjects'  were  instructed  to  decide  if  one  of 
two  letters  presented  on  the  CRT  belonged  to  a  previously  memorized  set  of 
letters.  A  match  will  henceforth  be  referred  to  as  a  positive  trial  while  a 
mismatch  will  be  labeled  as  a  negative  trial.  Each  set  of  thirty  trials 
began  with  a  six  sec  presentation  of  a  memory  set  of  either  two,  three  or 
four  letters.  In  the  30  trials  that  followed  the  presentation  of  each 
memory  set,  the  subjects'  task  was  to  deflect  a  joystick  in  one  direction  if 
one  of  the  two  probe  items  matched  an  item  from  the  memory  set  and  in  the 
opposite  direction  if  neither  of  the  letters  were  from  the  memory  set.  The 
joystick  was  manipulated  with  the  left  hand.  The  direction  of  the 
deflection  of  the  joystick  for  the  two  responses  was  counterbalanced  across 
subjects.  The  two  probe  items  were  presented  simultaneously  for  a  duration 

of  200  msec.  The  ISI  was  3000  msec.  Subjects  were  given  1500  msec  to 

indicate  their  response.  Responses  prior  to  150  msec  and  after  1500  msec 
following  stimulus  onset  were  scored  as  incorrect.  Instructions  emphasized 
both  speed  and  accuracy. 

Two  variables  served  as  blocking  factors  within  the  Sternberg  task. 

First,  subjects  performed  the  task  in  both  CM  and  VM  conditions.  In  the  CM 

condition,  targets  were  always  selected  from  one  set  of  letters  (G,J,N,X) 
while  distractors  were  selected  from  another  set  of  letters 
( P ,H ,Z , B , F ,D , V ,T) .  In  the  VM  condition  both  targets  and  distractors  were 

1  .   
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chosen  from  the  same  set  of  letters  ( P ,H , 1 , B , F ,D , V ,T) .  Targets  and 
distractors  exchanged  roles  over  trials  in  the  VM  condition.  The  second 
blocking  factor  was  the  number  of  items  in  the  memory  set.  Subjects 
performed  the  cask  with  either  two,  three  or  four  memory  set  items.  A  third 
factor,  the  probability  of  a  positive  or  negative  trial  was  fixed  at  .50  in 
each  block.  On  a  positive  trial,  one  of  the  items  was  a  target  and  the 
other  was  a  distractor.  The  targets  occurred  equally  often  on  the  left  and 
right.  On  a  negative  trial,  both  of  the  items  were  distractors.  The 
Sternberg  probes  were  presented  within  a  rectangular  frame  that  also  served 
as  the  target  in  the  Cracking  task. 

In  the  dual-task  blocks  subjects  concurrently  performed  the  tracking 
task  and  the  Sternberg  task.  Pursuit  step  tracking  was  defined  as  the 
primary  task.  Thus,  in  these  conditions  subjects  were  required  to  encode  a 
set  of  memory  items,  respond  to  the  presentation  of  the  probes  and  minimize 
tracking  error.  Subjects  employed  the  left  joystick  for  their  discrete 
responses  in  the  Sternberg  task  and  the  right  joystick  for  their  continuous 
responses  in  the  tracking  task.  Following  each  block  of  trials  the  subjects 
were  informed  of  their  RT  and  accuracy  in  the  Sternberg  task  and  the  average 
root  mean  square  (RMS)  tracking  error. 


Insert  Figure  2  about  here 

The  temporal  sequence  of  the  trials  in  the  dual-task  conditions  is 
graphically  illustrated  in  Figure  2.  The  sequence  proceeded  as  follows: 

The  subjects  began  tracking  changes  in  the  spatial  position  of  the  target. 
Two  spatial  changes  in  the  target  occurred  with  an  ISI  of  1.5  sec.  At  this 
time,  the  Sternberg  memory  set  was  presented  for  6  sec  in  the  center  of  the 
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CRT,  above  the  tracking  task.  Following  presentation  of  the  memory  set,  the 
changes  in  the  spatial  position  of  the  tracking  target  alternated  with  the 
occurrence  of  the  Sternberg  probes.  Thus,  the  IS1  between  events  within 
each  task  was  3.0  sec  while  the  ISI  between  Sternberg  probes  and  the 
displacement  of  the  tracking  target  was  1.5  sec.  After  the  occurrence  of  60 
events,  another  memory  set  was  presented,  followed  by  another  30  Sternberg 
probes  and  30  changes  in  the  position  of  the  target  in  the  tracking  task. 
Thus,  each  dual-task  block  was  composed  of  two  different  Sternberg  memory 
sets,  60  presentations  of  Sternberg  probes  and  62  changes  in  the  position  of 
the  target  in  the  tracking  task.  Dual-task  blocks  lasted  approximately  200 
sec.  Subjects  continuously  performed  the  tracking  task  during  each  dual 
task  block. 

ERP  Recording  System 

Elect roencephalog raphic  activity  (EEG)  was  recorded  from  three  midline 
sites  (Fz,  Cz  and  Pz  according  to  the  International  10-20  system:  Jasper, 
1958)  and  referred  to  linked  mastoids.  Two  ground  electrodes  were  positioned 
on  the  left  side  of  the  forehead.  Beckman  Biopotential  electrodes  affixed 
with  Grass  paste  were  used  for  scalp,  mastoid  and  ground  recording.  Beckman 
electrodes,  affixed  with  adhesive  collars,  were  also  placed  below  and  supra- 
orbitally  to  the  right  eye  to  record  electro-oculogram  (EOG).  Electrode 
impedances  did  not  exceed  10  kohms. 

The  EEG  and  E0C  were  amplified  with  Van  Gcgh  model  50000  amplifiers 
(time  constant  10  sec  and  upper  half  amplitude  of  35  Hz,  3dB  octave 
roll-off).  Both  EEG  and  EOG  were  sampled  for  1300  msec,  beginning  100  msec 
prior  to  stimulus  onset.  The  data  was  digitized  every  10  msec.  ERP's  were 
filtered  off-line  (-3dB  at  6.29  Hz,  OdB  at  14.29Hz)  prior  to  statistical 
analysis.  Evaluation  of  each  EOG  record  for  eye  movements  and  blinks  was 


Development  of  Automatic  Processing 


Page  16 


conducted  off-line.  EOG  contamination  of  EEC  traces  was  compensated  for 
through  the  use  of  an  eye  movement  correction  procedure  (Gratton,  Coles  and 
Donchin,  1982). 

Des  ign 

A  repeated  measures,  five  way  factorial  design,  was  employed.  The 
factors  were  task  level  (single  task  Sternberg,  Sternberg  with  first  order 
tracking,  and  Sternberg  with  second  order  tracking),  the  structure  of  the 
Sternberg  task  (CM  or  VM),  memory  set  size  (2,  3,  or  A),  session  (1  and  10), 
and  the  type  of  Sternberg  trial  (positive  or  negative).  The  first  four 
variables  served  as  blocking  factors,  the  fifth  factor  was  varied  within 
blocks.  Subjects  also  performed  single  task  tracking  blocks  with  first  and 
second  order  control  dynamics. 

Procedure 

Each  of  the  seven  subjects  participated  in  all  of  the  experimental 
conditions.  Ten  experimental  sessions,  within  a  three  week  period,  were 
required  to  complete  the  experiment.  In  sessions  one  and  six  through  ten, 
subjects  performed  two  replications  of  the  dual  and  four  replications  of  the 
single  task  conditions.  Thus,  in  these  sessions  subjects  completed  2A  dual¬ 
task  blocks  comprised  of  120  trials  each  and  32  single  task  blocks  each 
composed  of  60  trials.  Dual-task  blocks  took  approximately  3.5  min  each 
while  single  task  conditions  required  1.5  min  each.  Subjects  were  permitted 
to  take  brief  rest  breaks  between  each  of  the  blocks  and  longer  breaks 
whenever  necessary.  Each  of  these  sessions  lasted  approximately  2.5  hours. 

In  sessions  two  through  five,  subjects  performed  five  replications  of 
the  six  single  task  Sternberg  conditions  (task  structure  x  set  size).  Thus, 
during  these  sessions  subjects  completed  30  blocks  of  trials,  each  requiring 
approximately  1.5  min.  These  sessions  lasted  1  hour  each.  The  order  of 
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presentation  of  experimental  blocks  was  counterbalanced  across  subjects  in 
each  of  the  sessions  in  a  latin  square  design.  Subjects  performed  20,160 
Sternberg  probe  trials  over  the  ten  sessions,  10,080  each  in  the  CM  and  VM 
conditions.  The  tracking  task,  in  both  single  and  dual-task  conditions,  was 
performed  for  approximately  nine  hours. 

RT  and  accuracy  measures  were  recorded  in  the  Sternberg  task  throughout 
the  ten  sessions.  RMS  tracking  error  was  recorded  in  session  one  and  six 
through  ten.  ERPs  were  recorded  during  the  presentation  of  the  Sternberg 
probes  in  the  first  and  last  sessions. 

Data  Analysis 

RT  in  the  Sternberg  task  was  defined  as  the  interval  between  the 
appearance  of  the  probes  and  the  subject's  keypress.  RMS  error  in  the 
tracking  task  was  calculated  every  50  msec  during  single  and  dual-task 
conditions.  These  data  were  averaged  off-line  according  to  experimental 
condition. 

The  single-trial  ERPs  acquired  during  single  and  dual-task  Sternberg 
performance  were  averaged  separately  for  each  of  the  experimental  conditions 
in  sessions  l  and  10.  Each  of  the  single-subject  averages  was  composed  of 
at  least  50  single  trial  ERPs.  The  amplitude  and  latency  of  the  N200  and 
P300  components  were  quantified  in  the  following  manner.  On  each 
single-trial  P300  amplitude  was  measured  as  the  difference  between  the 
maximum  positive  deflection  between  300  and  900  msec  after  the  presentation 
of  the  probes  and  the  baseline,  that  was  defined  as  the  average  voltage 
recorded  over  the  100  msec  epoch  just  preceeding  the  stimulus  array  (Coles, 
Gratton,  Kramer  and  Miller,  1986).  The  latency  was  defined  as  the  time  at 
which  the  P300  reached  its  maximum  amplitude.  Since  the  signal  to  noise 
ratio  for  the  N200  component  is  smaller  than  that  for  the  P300,  the 
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amplitude  and  latency  of  the  N200  were  measured  on  the  average  waveforms  for 
each  of  the  experimental  conditions.  The  N200  was  defined  as  the  maximum 
negative  deflection  occurring  between  200  and  450  msec  post-probe.  Due  to 
the  enormity  of  the  data  base  in  the  present  study,  analyses  of  P300  and 
N200  measures  will  be  restricted  to  the  Pz  and  Cz  sites,  respectively. 

Results  and  Discussion 

This  section  is  organized  in  the  following  manner.  First,  the  results 
for  both  of  the  single  tasks  will  be  presented  for  the  first  and  last 

experimental  sessions.  This  is  done  to  demonstrate  that  we  have  replicated 

the  effects  of  task  structure  and  practice  on  measures  of  performance  and  to 
illustrate  how  psychophysiological  measures  can  be  used  to  explicate  changes 
in  cognitive  processing.  Second,  we  provide  a  comparison  of  the  results 
obtained  in  the  single  and  dual  task  conditions,  contrasting  the  transition 

from  single  to  dual  tasks  as  well  as  examining  the  effects  of  an  increase  in 

the  difficulty  of  the  tracking  task  on  Sternberg  performance.  Several 
dependent  measures  are  examined  in  both  the  single  and  dual-task  analyses. 
These  include:  RMS  error  in  the  tracking  task,  reaction  time  (RT),  accuracy, 
measures  of  the  amplitude  and  latency  of  the  N200  and  P300  components  of  the 
ERP  and  the  RT/P300  ratio  in  the  Sternberg  task. 


Insert  Figure  3  about  here 


Performance  measures 


Single  Tasks 


A  five-way  repeated  measures  ANOVA  (subjects  x  session  x  task  structure 
x  memory  set  size  x  response  type)  was  performed  on  the  RT  data.  Figure  3 
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presents  the  average  RTs  for  the  CM  and  VM  conditions  and  the  three  memory 
set  sizes  in  sessions  1  and  10.  As  suggested  by  the  figure,  subjects 
performed  the  Sternberg  task  more  quickly  in  session  10  than  they  did  in 
session  1  (_F(  1  ,6  )=*  19 . 9 ,  £<.01).  RT  was  faster  in  the  CM  than  in  the  VM 
conditions  (_F(  1 ,6)  =  52.0,  £<.01).  The  analysis  also  indicated  a  significant 
three-way  interaction  among  session,  task  structure,  and  memory  set  size 
(_F(2,12)=  10.5,  £<.01),  suggesting  that  memory  set  size  did  not  have  an 
effect  on  RT  in  the  CM  conditions  in  session  10.  This  interpretation  of  the 
three-way  interaction  was  supported  by  a  series  of  post-hoc  comparisons . 1 
In  session  1,  RTs  increased  as  a  function  of  set  size  for  both  CM  (F(l,6)= 
26.7,  £<.01)  and  VM  (_F(  1 ,6)=74 .8  ,  _£<.01)  conditions.  In  session  10,  after 
subjects  had  received  over  20,160  trials  of  practice,  memory  set  size  still 
produced  a  significant  effect  in  the  VM  conditions  (_F( 1 ,6)*76.2,  £<.01). 
However,  in  the  CM  conditions,  the  size  of  the  memory  set  did  not  have  a 
significant  effect  on  RT  (£>.65).  The  differences  in  memory  set  effects 
between  CM  and  VM  conditions  as  a  function  of  practice  are  further  supported 
by  the  memory  set  slopes  obtained  in  a  series  of  regression  analyses 
performed  on  these  conditions.  The  slopes  for  the  CM  positive  and  negative 
conditions  in  sessions  1  and  10  were  39.6,  43.1,  1.3  and  2.1  msec, 
respectively.  The  slopes  for  the  same  VM  conditions  were  53.6,  71.5,  52.6 
and  72.2,  respectively. 

The  pattern  of  RTs  produced  in  the  CM  and  VM  -conditions  is  consistent 
with  previous  findings  and  fulfills  the  "zero  slope"  criterion  for 
automat icity.  (Logan,  1985;  Shiffrin  and  Schneider,  1977;  Shiffrin,  Dumais 
and  Schneider,  1984).  Even  extensive  practice  does  not  improve  memory 
search  performance  when  subjects  are  unable  to  consistently  map  stimuli  to 
responses.  However,  when  subjects  are  able  to  consistently  map  stimuli  to 
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responses,  performance  improves  such  that  the  time  required  to  compare  two 
probes  to  four  items  in  memory  does  not  significantly  differ  from  the  time 
required  to  compare  the  probes  to  two  memory  set  items. 

It  is  important  to  note  that  performance  does  improve  in  the  VM 
conditions  with  practice. 2  However,  this  decrease  in  RT  was  in  the 
intercept  not  the  slope.  In  the  CM  conditions  both  the  intercept  and  memory 
set  size  slope  decreased  with  practice.  This  observation  of  decreased  RT  in 
the  VM  conditions  is  not  new.  In  fact,  it  has  been  hypothesized  that  the 
improvement  in  performance  may  be  attributed  to  familiarization  with  the 
cask  instructions,  equipment  useage,  or  selection  of  strategies  (Ackerman, 
1987)  or  the  automatization  of  consistent  components  of  the  task  (Schneider, 
Dumais,  and  Shiffrin,  1984).  However,  at  the  present  time  these  hypotheses 
remain  untested.  One  strategy  for  examining  these  differences  in 
performance  is  to  analyze  the  changes  in  RT  within  the  framework  of 
Sternberg's  (1966,  1969)  additive  factors  methodology.  Within  such  a 
framework  the  pattern  of  results  obtained  in  the  present  study  would  suggest 
that  either  encoding  and/or  response  processes  become  more  efficient  in  both 
the  VM  and  CM  conditions,  while  the  need  for  memory  comparison  processes 
diminishes  in  the  CM  but  not  in  the  VM  conditions.  The  distinction  between 
encoding  and  response  demands  will  be  addressed  by  the  analysis  of  P300 
latency. 


Insert  Table  1  about  here 


A  further  decomposition  of  the  changes  in  processing  with  practice  can 
also  be  provided  by  an  examination  of  the  response  type  variable.  Although 
we  did  not  explicity  manipulate  the  number  of  display  comparisons,  as  has 
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been  done  in  other  investigations  of  automatic  and  controlled  processing,  we 
did  present  the  target  items  on  both  the  right  and  left  side  of  the  frame. 

On  half  of  the  positive  probe  trials  a  memory  set  item  appeared  on  the  left 
and  a  distractor  on  the  right  while  this  arrangement  was  reversed  on  the 
other  half  of  the  positive  trials.  Since  the  probes  were  presented  within  a 
1.9  degree  visual  angle  for  200  msec,  it  was  unlikely  that  subjects  moved 
their  eyes  to  scan  the  display  for  a  target.  However,  as  is  apparent  from 
Table  1,  subjects  did  appear  to  shift  their  attention  from  the  left  to  the 
right  when  performing  in  the  VM  conditions.  Thus,  it  took  subjects  34  msec 
longer  to  respond  to  the  target  when  it  appeared  on  the  right  than  it  did 
when  the  target  occurred  on  the  left  of  the  frame  (_F(  1  ,6  )«6 . 2  ,  £<.05).  In 
the  CM  conditions,  RT  was  not  influenced  by  the  position  of  the  probe 
(£>.48).  It  is  interesting  to  note  that  this  relationship  between  response 
type  and  task  structure  did  not  interact  with  the  amount  of  practice.  This 
would  imply  that  the  rapid  and  apparently  parallel  display  processing 
strategy  exhibited  in  the  CM  condition  developed  within  the  first  1440 
trials  of  practice  (session  1),  far  more  quickly  than  the  decrease  in  the 
memory  set  slope.  Negative  trials  took  longer  to  respond  to  than  positive 
trials  in  both  CM  and  VM  conditions  (£(  2 , 12)  =  10.22 ,  £<.01). 


Insert  Figure  4  about  here 


Figure  4  presents  the  average  error  rate  for  the  CM  and  VM  conditions 
and  the  three  memory  set  sizes  in  sessions  1  and  10.  The  mean  error  rate 
across  all  of  the  experimental  conditions  was  1.5%.  The  pattern  of  results 
was  quite  similar  to  that  obtained  for  RT.  Subjects  made  significantly  more 
errors  when  performing  in  the  VM  conditions  than  they  did  in  the  CM 
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conditions  (_F(  1 ,6  )  =  3 1 . 2  ,  j><.01).  Error  rate  also  increased  as  a  function  of 
memory  sec  size  (_F(  2 , 12  )  =  7 . 2  ,  £<.01).  A  significant  three-way  interaction 
among  session  x  task  structure  x  set  size  was  also  obtained  ( F( 2 , 1 2 )=5 . 8 , 
£<•05),  suggesting  that  subjects'  error  rate  increased  as  a  function  of 
memory  load  for  both  CM  and  VM  conditions  in  session  1,  but  only  for  VM 
conditions  in  session  10.  Since  higher  error  rates  were  associated  with 
longer  RTs,  these  data  suggest  that  subjects  were  not  trading  speed  for 
accuracy  while  performing  the  Sternberg  task. 

Single  task  tracking  performance  was  evaluated  by  calculating  RMS  error 
for  each  condition  and  submitting  this  data  to  a  three-way  repeated  measures 
ANOVA  (subjects  x  sessions  x  system  order).  Subjects  performed 
significantly  better  with  first  order  dynamics  than  they  did  with  second 
order  dynamics  (_F(  1 ,6)=27 .5  ,  £<.01).  Subjects  tracking  performance  also 
improved  with  practice  ( F( 1  ,6 )= 11 . 85  ,  j><.01).  The  interaction  between  system 
order  and  session  was  not  significant. 


Insert  Figure  5  about  here 


Event-Related  Potentials 

The  ERPs  were  recorded  to  address  several  issues.  First,  since  the 
latency  of  the  P300  component  is  influenced  by  the  duration  of  stimulus 
evaluation  processes  but  is  relatively  unaffected  by  response  selection  and 
execution  processes,  the  joint  use  of  RT  and  P300  latency  were  used  to 
lecompose  the  information  processing  demands  of  the  Sternberg  task  as  a 
•  met  ion  of  task  structure,  practice  and  dual-task  demands.  Second,  the 
is;  ,'ude  of  the  P300  has  been  found  to  vary  with  the  perceptual/cognitive 
-i-  :s  ■>*  a  task.  Therefore,  this  measure  was  employed  to  provide  an 
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estimate  of  the  resource  costs  of  the  tasks,  both  in  isolation  and  when 
combined  in  the  dual-task  conditions.  Finally,  since  the  N200  component 
provides  an  index  of  the  processing  of  mismatches,  we  would  expect  that  it 
will  assist  us  in  explicating  the  effects  of  the  experimental  manipulations 
on  the  target/non-targ et  decision. 

Figure  5  shows  the  grand  average  ERPs,  recorded  at  the  parietal  site, 
for  each  of  the  single  task  Sternberg  conditions.  ERP  components  are 
traditionally  defined  in  terms  of  their  latency  relative  to  a  stimulus  or 
response,  scalp  distribution,  and  sensitivity  to  experimental  manipulations 
(Donchin,  1981;  Kramer,  1985;  Sutton  and  Ruchkin,  1984).  The  large  positive 
going  deflection  in  the  waveforms  became  increasingly  positive  from  the  Fz 
to  the  Pz  electrode  site  (_F(  2 , 12 )  =  34 . 2 ,  j><.01)  and  the  base  to  peak  measures 
were  maximal  between  350  and  800  msec  post-stimulus.  Based  on  these 
criteria  this  positive  deflection  can  be  identified  as  the  P300.  The 
negative  going  deflection  preceeding  the  P300  increased  in  negativity  from 
the  parietal  to  frontal  site  (F(2,12)=22.4,  j><.01)  and  was  maximal  in 
amplitude  between  200  and  400  msec  post-stimulus.  This  component  will  be 
referred  to  as  the  N200. 


Insert  Figure  6  about  here 


P300  component  The  mean  P300  latency  values  obtained  in  each  of  the 
single  task  conditions  are  presented  in  Figure  6.  The  pattern  of  latencies 
in  the  positive  VM  conditions  is  quite  similar  to  that  found  for  RT.  The 
latency  of  the  P300  increases  as  a  function  of  memory  set  size  in  both 
sessions  1  and  10.  The  pattern  of  P300  latencies  obtained  in  the  positive  CM 


condition  is  also  quite  similar  to  the  pattern  of  RTs 


In  session  I ,  P300 
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latency  increased  from  the  set  size  2  to  set  size  3  condition.  In  session 
10,  the  effect  of  memory  set  size  on  P300  latency  was  greatly  reduced. 

These  differences  were  quantified  in  a  five-way  repeated  measures  ANOVA 
(subjects  x  session  x  task  structure  x  memory  set  size  x  response  type).  As 
suggested  by  the  figure,  P300  latency  was  significantly  shorter  in  the  CM 
conditions  than  it  was  in  the  VM  conditions  (£(  1 , 6 )»27 . 1 ,  £<.01).  A 
significant  interaction  between  session,  tasks  structure  and  memory  set  size 
suggests  that  the  memory  set  slope  in  the  CM  condition  decreased  from  the 
first  to  the  last  session  while  the  slope  obtained  in  the  VM  conditions  was 
not  influenced  by  practice  (£(  2 , 12 )»4 . 9 ,  £<.05).  This  interpretation  of  the 
three-way  interaction  was  supported  by  a  series  of  post-hoc  comparisons. 
Significant  memory  set  effects  were  obtained  for  the  CM  condition  in  session 
1  and  the  VM  conditions  in  sessions  1  and  10  (p<.05).  The  memory  set  effec: 
was  not  significant  for  the  CM  condition  in  session  10  (p>.70).  The  slopes 
for  the  CM  positive  conditions  in  sessions  1  and  10  were  14.6  and  .28.  For 
the  VM  conditions  the  slopes  were  22.6  and  27.2. 

The  pattern  of  P300  latencies  in  the  positive  CM  and  VM  conditions  is 
consistent  both  with  the  pattern  of  RT  results  obtained  in  the  present  study 
and  the  reduced  P300  slopes  found  in  practiced  CM  conditions  in  previous 
studies  (Hoffman,  Simmons,  and  Houck,  1983;  Hoffman  et  al. ,  1985;  Kramer  et 
al.,  1986).  It  is  interesting  to  note,  however,  that  even  in  session  1  the 
slope  in  the  CM  condition  is  flat  at  the  higher  memory  set  sizes.  This 
finding  is  consistent  with  phase  2  processing  in  Schneider's  (1985)  model  of 
automaticity.  In  phase  2  of  the  model,  automatic  and  control  processes 
co-occur  and  the  relatively  weak  automatic  processing  finishes  before 
controlled  processing  only  when  many  controlled  processing  comparisons  must 
be  made.  Thus,  there  should  be  a  flattening  of  the  slope  for  the  higher 
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memory  set  sizes.  The  finding  that  P300  latency  and  not  RT  shows  this 
flattening  early  in  practice  suggests  that  the  component  of  stimulus 
evaluation  processing  influenced  by  the  memory  load  manipulation  becomes 
automated  prior  to  the  components  of  response  related  processing  that  are 
affected  by  memory  load  (Marcel,  1976). 

Evidence  for  additional  differences  in  processing  as  a  function  of  task 
structure  is  provided  in  Table  1.  Consistent  with  the  pattern  of  RT's, 

P300s  elicited  by  the  VM  targets  presented  on  the  left  side  of  the  display 
were  29  msec  faster  than  those  elicited  by  targets  presented  on  the  right 
side  of  the  display  (£(  1 ,6 )  =  10 . A  ,  _£<.01).  However,  the  P300s  elicited  in 
the  CM  conditions  were  uninfluenced  by  the  position  of  the  target  in  the 
display  (p>.35).  The  relationship  between  response  type  and  task  structure 
did  not  interact  with  the  amount  of  practice.  Thus,  the  P300  results  when 
viewed  in  conjunction  with  RT  suggest  that  the  probes  were  processed  in 
parallel  in  the  CM  conditions  while  in  the  VM  conditions  it  appears  that 
subjects  serially  processed  the  two  items.  Furthermore,  since  the  magnitude 
of  the  differences  in  RT  and  P300  latency  were  essentially  the  same  in  the 
VM  conditions,  the  serial  processing  strategy  cannot  be  attributed  to 
differences  in  response  criterion.  In  both  VM  and  CM  conditions  negative 
trials  elicited  later  P300s  than  positive  trials  (_F  ( 2,12)“7 .2 ,  j><.01). 

An  analysis  of  the  performance  data  indicated  that  the  RT  intercept  for 
the  memory  set  size  function  significantly  decreased  for  both  the  CM  and  VM 
conditions  from  session  1  to  10.  Within  the  framework  of  Sternberg's 
additive  factors  logic  this  suggests  more  efficient  utilization  of  either 
encoding  and/or  response  processes.  Given  that  P300  latency  is  primarily 
sensitive  to  stimulus  evaluation  processes,  the  joint  examination  of  RT  and 
P300  latency  intercepts  can  aid  in  the  localization  of  this  practice  effect. 
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The  P300  latency  intercept  was  uninfluenced  by  the  level  of  practice 
(p>.50).  This  pattern  of  results  suggests  that  the  reduced  RT  intercepts 
obtained  in  the  CM  and  VM  conditions  after  practice  can  be  attributed  to  a 
reduction  in  the  time  required  to  complete  response  processes. 

Although  the  P300  latencies  elicited  by  the  target  stimuli  were 
consistent  with  the  pattern  of  RTs ,  the  negative  or  non-target  P300s  showed 
a  different  pattern  of  results  (see  figure  6).  A  significant  four-way 
interaction  among  session,  task  structure,  memory  set  size  and  response  type 
(£(4 ,24 )  =  1 1 . 2 ,  £<.01)  suggested  that  the  memory  set  effect  was  diminished  in 
the  negative  conditions.  Post-hoc  comparisons  supported  this  observation. 
The  memory  set  effect  was  not  significant  in  any  of  the  negative  conditions. 
The  diminished  memory  set  effect  in  negative  conditions  has  also  been  found 
in  other  studies  (Adams  and  Collins,  1978;  Pfefferbaum  et  al. ,  1985;  Strayer 
et  al.,  1987;  van  Dellen  et  al.,  1985).  This  insensitivity  of  P300  latency 
to  the  set  size  manipulation  may  be  the  result  of  a  deadline  strategy  in  the 
negative  conditions. 

The  differences  among  the  mean  P300  amplitudes  obtained  in  each  of  the 
single  task  conditions  were  quantified  in  a  five-way  repeated  measures  ANOVA 
(subjects  x  sessions  x  response  type  x  memory  set  size  x  task  structure). 
Consistent  with  previous  studies  (Ford  et  al.,  1980;  Hoffman,  Simmons,  and 
Houck,  1983;  Kramer  et  al.,  1986),  P300s  elicited  by  the  targets  were 
significantly  larger  than  P300s  elicited  by  non-cargets  (F(2 , 12)*18.2 , 
jd<.01).  No  other  main  effects  or  interactions  were  significant. 


Insert  Figure  7  about  here 
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RT/P30Q  ratio  A  single  trial  ratio  of  RT  to  P300  latency  was 
computed  to  determine  changes  in  the  relative  proportion  of  post-stimulus 
evaluation  processing  during  learning.  Figure  7  presents  the  mean  RT/P300 
ratios  for  each  of  the  single  task  conditions.  A  referent  value  in  the 
figure  is  the  solid  horizontal  line  drawn  at  the  RT/P300  ratio  of  1.0.  This 
ratio  reflects  the  co-occurrence  in  time  of  the  P300  peak  latency  and  the  RT 
response.  Values  larger  than  1.0  indicate  that  RT  was  preceeded  by  P300 
while  values  less  than  1.0  indicate  that  P300  followed  RT. 

The  effects  of  experimental  manipulations  on  the  RT/P300  ratio  were 
quantified  in  a  five-way  repeated  measures  ANOVA  (subjects  x  session  x  task 
structure  x  memory  set  size  x  response  type).  As  suggested  by  figure  7,  the 
RT/P300  ratio  decreased  with  practice  in  both  the  CM  and  VM  conditions 
(_F( 1 ,6)=37.9,  jK.01).  In  the  CM  and  VM  conditions  in  session  1  the  RT/P300 
ratios  were  equal  to  or  greater  than  1.0.  However,  in  session  10  all  of  the 
CM  conditions  and  the  smaller  set  sizes  in  the  VM  conditions  produced  ratios 
of  less  than  1.0.  A  significant  two-way  interaction  between  session  and 
task  structure  (£(  1 ,6)-35.0,  £<.01)  indicated  that  the  reduction  in  the 
RT/P300  ratio  was  larger  in  the  CM  than  the  VM  conditions.  These  results 
have  several  important  implications.  First,  they  suggest  that  the 
extraction  of  perceptual  information  becomes  more  efficient  following 
practice.  Subjects  are  capable  of  emitting  faster  and  more  accurate 
responses  on  the  basis  of  less  evaluation  of  the  .stimulus  array.  Second,  the 
smaller  RT/P300  ratios  in  the  practiced  CM  conditions  suggest  that  the 
process  of  information  extraction  is  most  efficient  following  consistent 
training.  LaBerge  (1981)  has  described  this  process  of  efficient  perceptual 
encoding  as  within  system  automaticity. 


A  significant  effect  of  response  type  indicated  that  non-targets 
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required  more  post-stimulus  evaluation  processing  than  targets  ( F( 1 ,6 )=38 .4  , 
£<•01 ).  However,  the  difference  between  targets  and  non-targets  decreased 
with  practice,  especially  in  the  CM  condition  (JF(  1 ,6)«*7 .2  ,  £<.05).  A 
decrease  in  the  RT/P300  ratio  memory  set  size  effect  was  also  obtained  when 
subjects  received  substantial  practice  on  the  task  (j[(2 , 12)=9.6 ,  £<«01). 

The  increased  RT/P300  ratio  with  larger  set  sizes  and  non-targets  has 
previously  been  described  in  terms  of  the  resolution  of  uncertainty  that 
results  from  a  re-checking  of  memory.  It  has  been  asserted  that  this 
process  of  re-checking  occurs  subsequent  to  the  elicitation  of  P300  and 
therefore  results  in  a  lengthening  of  RT  in  the  absence  of  changes  in  P300 
latency  (Ford  et  al.,  1979).  Assuming  that  such  a  re-checking  process 
occurs,  our  results  indicate  that  practice,  and  especially  practice  that 
leads  to  the  development  of  automat icity ,  reduces  the  uncertainty  of  the 
memory  comparison  process  (see  also  Kramer  et  al.,  1986). 

N2Q0  component  The  mean  N200  latency  and  amplitude  values  recorded 
at  CZ  were  quantified  in  five-way  repeated  measures  ANOVAs  (subjects  x 
sessions  x  response  type  x  task  structure  x  memory  set  size).  The  N200's 
elicited  by  the  non-targets  were  significantly  larger  than  the  N200's 
elicited  by  the  targets  (£(1,6)»6.4,  £<.05).  This  result  is  consistent  with 
a  large  body  of  literature  which  has  found  that  N200s  are  larger  for 
mismatches  than  they  are  for  matches  (Naatanen,  Simpson  and  Loveless,  1982; 
Naatanen  and  Gaillard,  1983;  Ritter  et  al.,  1984).  The  only  other 
significant  effect  for  N200  amplitude  was  session  (_F(1 ,6)»13.8,  £<.01). 
N200's  were  larger  in  session  10  than  they  were  in  session  1.  This  effect 
may  be  attributed  to  a  reduction  in  the  latency  variability  of  subjects' 
performance  and  ERP  components  with  extended  practice  on  the  Sternberg  task. 
It  is  interesting  to  note  that  task  structure  did  not  influence  the 
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amplitude  of  the  N200's  elicited  by  the  stimuli.  Thus,  consistent  with 
previous  findings  (Kramer  et  al.f  1986)  this  pattern  of  results  suggests 
that  the  mismatch  detection  processes  reflected  by  N200  do  not  differ  in 
automatic  and  nonautomatic  tasks.  No  significant  effects  were  obtained  for 
N’200  latency. 

Dual-Task  Interactions 

The  results  presented  in  this  section  will  provide  a  comparison  of 
performance  measures  and  ERP  components  obtained  in  the  single  and  dual  task 
conditions,  contrasting  the  transition  from  single  to  dual  tasks  as  well  as 
examining  the  effects  of  an  increase  in  the  difficulty  of  the  tracking  task 
on  Sternberg  performance.  Since  the  previous  section  has  already  dealt  with 
the  significant  effects  obtained  within  the  single  tasks,  the  current 
section  will  be  confined  to  a  presentation  of  those  effects  that  interact 
with  task  level  (e.g.,  single  task,  dual  task  easy  tracking,  dual  task 
difficult  tracking).  This  section  will  be  organized  in  the  same  format  as 
the  single  task  section.  Analyses  will  include  Sternberg  RT  and  error  data, 
RMS  tracking  error,  measures  of  the  amplitude  and  latency  of  the  P300  and 
N200  components  and  the  RT/P300  ratio  in  the  Sternberg  task. 


Insert  Figures  8  and  9  about  here 


Performance  measures 

Figures  8  and  9  present  the  mean  RT's  obtained  in  the  Sternberg  task 
when  performed  in  conjunction  with  the  first  order  and  second  order  versions 
of  the  Step  Tracking  task.  The  pattern  of  RTs  obtained  in  the  dual  task 
conditions  are  quite  similar  to  that  obtained  in  the  single  task  Sternberg 
condition.  RTs  increased  with  set  size  for  the  VM  and  CM  conditions  in 
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session  1  and  the  VM  conditions  in  session  10.  Decreases  in  the  intercept 
of  the  set  size  function  can  be  observed  for  both  the  CM  and  VM  conditions 
with  practice.  The  RTs  for  the  nontarget  responses  were  longer  than  those 
for  the  target  responses. 

A  six-way  repeated  measures  ANOVA  (subjects  x  task,  level  x  session  x 

task  structure  x  memory  set  size  x  response  type)  was  performed  on  the  RT 

data.  A  significant  main  effect  was  obtained  for  task  level  ( F(  2 , 12 )=4 . 5  , 
_p<.05).  Post-hoc  comparisons  indicated  that  RT  increased  both  with  the 
introduction  of  the  tracking  task  and  with  an  increase  in  its  difficulty 
(p's  <.05).  A  significant  interaction  between  task  level  and  memory  set 
size  was  also  obtained  (_F(  4 , 24)=4 . 9  ,  j><.01).  Post-hoc  comparisons  indicated 
that  the  interaction  was  due  to  the  reduction  in  memory  set  slope  from  the 
first  to  the  second  order  tracking  condition. 

The  finding  of  an  increase  in  RT  with  the  introduction  of  the  tracking 

task  and  an  increase  in  its  difficulty  when  viewed  in  conjunction  with  the 

decrease  in  the  memory  set  slope  from  the  first  to  the  second  order  dual 
task  conditions  suggests  that  the  main  effect  of  task  level  may  be  due  to  an 
increase  in  the  memory  set  intercept.  This  hypothesis  was  confirmed  by  an 
analysis  of  the  memory  set  intercepts.  The  intercept  increased 
significantly  from  the  single  task  to  the  dual  first  order  condition  as  well 
as  from  the  dual  first  order  to  the  dual  second  order  condition  (p's  <.05). 
Thus,  consistent  with  previous  findings  the  locus  of  the  interaction  of  the 
two  tasks  appears  to  be  during  either  encoding  and/or  response  selection 
(Trumbo  et  al.,  1967  ;  Vidulich  and  Wickens,  1981.;  Wickens  et  al.,  1980).  An 
analysis  of  P300  latency  will  allow  us  to  distinguish  between  an  encoding 
and  response  demand  interpretation  of  this  effect. 
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The  lack  of  an  interaction  of  task  level  with  the  structure  of  the  task 
(CM  and  VM  conditions)  and  the  amount  of  practice  underscores  the  importance 
of  a  careful  task  analysis.  Thus,  even  after  20,000  trials  of  practice  and 
the  attainment  of  the  "zero  slope"  criterion  of  automaticity ,  subjects' 
response  speed  in  the  CM  and  VM  conditions  was  similarly  affected  by  dual 
task  manipulations.  This  pattern  of  results  provides  additional  support  for 
the  assertion  that  it  is  not  tasks  that  become  automatic  but  instead  task 
components  or  processes  (Jonides  et  al.,  1985;  Logan,  1985;  Shiffrin,  Dumais 
and  Schneider,  1984). 

The  memory  comparison  process  reflected  by  the  memory  set  slope  was 
not  adversely  affected  by  the  introduction  of  the  tracking  task  or  an 
increase  in  its  difficulty.  In  fact,  an  underadditive  interaction  between 
memory  set  size  and  task  level  was  obtained  suggesting  an  increased  overlap 
in  the  memory  comparison  and  response  processes  in  the  difficult  dual  task 
conditions  (Pashler,  1984;  Stanovich  and  Pachella,  1977).  Thus,  in  the 
present  study  the  use  of  dual  task  methodology  has  allowed  us  localize  the 
development  of  automaticity  to  specific  task  components  (see  also  Logan, 
1978,  1979). 


Insert  Table  2  about  here 


Table  2  presents  the  average  error  rate  for  the  dual  task  Sternberg 
conditions.  The  pattern  of  errors  is  quite  similar  to  that  obtained  in  the 
single  task  conditions.  Error  rates  were  higher  in  the  CM  than  in  the  VM 
conditions.  Errors  increased  with  memory  set  size  while  decreasing  with 
practice.  A  main  effect  of  task  level  was  obtained  (_F(2,12)»8.0,  £<.01). 


Post-hoc  comparisons  indicated  that  the  error  rate  increased  with  the 
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introduction  of  the  tracking  task  and  an  increase  in  its  difficulty 
(p's<.05).  A  significant  two-way  interaction  between  task  level  and  session 
( _F(  2 , 12  )  =  1 1 .6  ,  £<.01)  indicated  that  the  task  level  difference  decreased  as 
subjects  received  practice  on  the  tasks. 

The  pattern  of  RMS  error  in  the  tracking  task  was  similar  to  that 
obtained  in  the  single  task  condition.  Subjects  performed  better  with  first 
order  dynamics  that  they  did  with  second  order  dynamics.  RMS  error 
diminished  with  practice.  Tracking  performance  was  unaffected  by  the 
transition  from  single  to  dual  task  conditions  and  the  levels  of  memory  load 
in  the  Sternberg  task.  Thus,  subjects  protected  their  "primary"  task 
performance  at  the  expense  of  their  performance  on  the  "secondary"  task. 

This  pattern  of  results  is  necessary  in  order  to  interpret  secondary  task 
decrements  (Sternberg  task)  in  terms  of  primary  task  (tracking  task)  demands 
(Wickens,  1984). 


Insert  Figures  10  and  11  about  here 


Event-Related  Potentials 

The  grand  average  ERPs  recorded  at  Pz  in  the  dual  task  conditions  are 
presented  in  Figures  10  and  11.  Visual  inspection  of  the  waveforms  suggests 
that  the  dual  task  ERPs  are  similar  in  morphology  to  those  elicited  in  the 
single  task  conditions.  The  waveforms  are  characterized  by  a  series  of 
negative  and  positive  peaks  with  a  large  amplitude  positivity  occurring  in 
the  range  of  450  to  700  msec  post-stimulus.  The  scalp  distribution  of  the 
N200  and  P300  components  was  also  equivalent  to  that  obtained  in  the  single 
task  conditions.  The  N200  increased  in  negativity  from  the  parietal  to  the 
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frontal  site  while  the  P300  became  increasingly  positive  from  frontal  to  the 
parietal  site. 


Insert  Table  3  about  here 


P300  component  The  mean  P300  latencies  obtained  at  the  Pz  recording 
site  in  each  of  the  dual  task  conditions  are  presented  in  Table  3.  As  can 
be  seen  by  a  comparison  of  the  table  with  the  single  task  P300  latencies  in 
Figure  6,  the  single  task  effects  of  task  structure,  memory  set  size, 
practice,  and  response  type  are  also  found  in  the  dual  task  conditions.  The 
only  significant  effect  of  task  level  was  an  increase  in  the  intercept  of 
the  memory  set  function  from  the  single  task  Sternberg  condition  to  the  dual 
task  condition  ( F( 2 , 1 2 )»3 . 9 ,  £<.05).  The  Intercept  did  not  increase  further 
when  the  difficulty  of  the  tracking  task  was  increased  from  first  to  second 
order  control  dynamics.  These  results  serve  to  clarify  the  locus  of  the 
increase  in  the  RT  intercept  with  the  introduction  of  the  tracking  task  and 
an  increase  in  its  difficulty.  Given  that  P300  latency  is  sensitive  to 
factors  that  influence  stimulus  evaluation  processes  and  relatively 
insensitive  to  response  factors,  the  pattern  of  P300  latency  and  RT  results 
suggest  that  the  imposition  of  the  tracking  task  influenced  both  encoding 
and  response  processes  while  the  increase  in  tracking  difficulty  had  its 
primary  effect  on  response  processes. 

An  analysis  of  P300  amplitudes  was  undertaken  to  determine  the  degree 
to  which  the  demands  imposed  upon  the  subjects  by  the  imposition  of  the 
tracking  task  and  an  increase  in  its  difficulty  would  be  reflected  in  the 
resources  allocated  to  the  Sternberg  task.  P300s'  elicited  in  the  Sternberg 
task  prior  to  extensive  practice  decreased  in  amplitude  with  the 
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introduction  of  the  tracking  task  and  an  increase  in  its  difficulty 
(_F(  2 , 1 2  )=9 .4  ,  £<.01).  However,  P300  amplitude  was  not  influenced  by  task 
level  after  25  hours  of  practice  on  the  single  and  dual  tasks.  These 
results,  interpreted  within  a  resource  theory  framework,  suggest  that  the 
resource  requirements  of  the  tracking  task  decreased  with  extensive  practice 
such  that  the  resource  demands  reflected  by  P300  amplitude  were  equivalent 
in  the  first  and  second  order  tracking  conditions. 

At  first  glance  this  interpretation  may  seem  to  be  at  odds  with  the 
finding  that  although  tracking  performance  improved  with  practice,  RMS  error 
was  still  significantly  larger  for  the  second  order  than  it  was  for  the 
first  order  condition  after  practice.  However,  previous  analyses  of  the 
resource  requirements  of  system  order  manipulations  have  indicated  that  they 
consume  both  perceptual  and  response  related  resources  (Trumbo  et  al.,  1967; 
Vidulich  and  Wickens,  1981).  Thus,  the  pattern  of  RMS  error  and  P300 
amplitude  suggests  that  although  the  response  related  demands  of  the 
tracking  task  may  have  remained  relatively  constant  over  practice,  the 
perceptual  demands  have  diminished  (see  Kramer  et  al.,  1983  for  further 
support ) . 

RT/P30Q  ratio  The  mean  RT/P300  ratios  obtained  in  the  dual  task 
conditions  are  presented  in  Table  4.  As  can  be  seen  from  the  table,  the 
major  effects  obtained  in  the  single  task  conditions  were  replicated  in  the 
dual  task  conditions.  RT/P300  ratio  decreased  with  practice  for  both  CM  and 
VM  tasks,  although  this  effect  was  largest  in  the  CM  conditions.  The  ratios 
were  larger  for  the  nontargets  than  they  were  for  the  targets  especially  in 
the  VM  conditions.  A  significant  two-way  interaction  between  task  level  and 
session  was  obtained  (£( 2 , 12 )»10 . 3 ,  p<.0l).  Post-hoc  comparisons  indicated 
that  this  effect  could  be  attributed  to  an  increase  in  the  RT/P300  ratio  in 
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the  second  order  conditions.  Thus,  the  RT/P300  ratios  recorded  on  the 
single  trials  are  consistent  with  the  pattern  of  results  obtained  for  the 
mean  RT's  and  P300  latencies.  Both  sets  of  analyses  suggest  that  the  time 
required  for  response  processes  increased  in  the  difficult  dual  task 
condition. 

N200  component  The  only  significant  single/dual  task  effect  for  the 
N200  component  was  a  main  effect  of  task  level  (_F(  2  , 1 2 )  =■  1 2 . 2  ,  _£<.01).  Post- 
hoc  comparisons  indicated  that  this  was  due  to  a  reduced  N200  amplitude  in 
the  second  order  conditions.  Since  this  is  the  condition  in  which  P300 
amplitude  was  also  smallest  it  is  conceivable  that  the  reduction  in  N200 
amplitude  was  due  to  increased  latency  variability  in  this  condition. 
Although  it  was  impossible  to  evaluate  the  latency  variability  of  the  N200 
since  measures  of  its  amplitude  and  latency  were  obtained  from  averages,  an 
analysis  of  both  P300  and  RT  single  trial  measures  indicated  an  increased 
latency  variability  in  the  second  order  conditions. 

Conclusions 

The  present  study  had  two  main  goals;  to  elucidate  the  acquisition 
rates  of  several  of  the  properties  of  automatic  processing  and  to  examine 
the  degree  to  which  automatic  processing  can  be  localized  to  particular 
components  of  the  search  task.  These  questions  were  addressed  by  a  combined 
methodological  approach  which  included  ERP  components,  dual-task 
manipulations  and  additive  factors  logic. 

Schneider's  (1985)  recent  extension  of  the  two-process  theory  of 
automatic  and  controlled  processing  (Schneider  and  Shiffrin,  1977;  Shiffrin 
and  Schneider,  1977)  can  be  used  as  a  framework  in  which  to  evaluate  the 
changes  in  processing  obtained  in  the  present  study  as  a  function  of 
practice,  task  structure  and  dual  task  demands.  The  first  phase  of  the 
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model  is  characterized  by  the  effect  of  memory  and  visual  display  load  on 
performance  in  the  search  task.  Controlled  processing  is  proposed  to 
account  for  these  effects.  Since  VM  search  tasks  do  not  have  a  consistent 
relation  between  stimuli  and  responses  their  performance  is  predicted  to 
remain  at  this  level  even  with  substantial  training.  Performance  in  the  VM 
condition  in  the  current  study  is  consistent  with  the  characteristics  of 
phase  1.  Even  after  20,000  trials  of  practice,  memory  comparison  and  display 
search  performance  had  not  improved  in  the  VM  conditions.  RT  and  P300 
latency  increased  with  memory  set  size  and  the  display  position  of  the 
target . 

In  phase  two,  controlled  and  automatic  processing  are  proposed  to 
co-occur  with  performance  being  a  mixture  of  the  two  processing  modes.  It  is 
argued  that  this  phase  occurs  shortly  after  the  introduction  of  consistent 
practice  and  is  characterized  by  a  flattening  of  the  memory  load  function 
for  the  higher  set  sizes  and  the  occurrence  of  the  "p°P  out"  effect  in 
visual  search.  The  reduction  in  the  memory  set  slope  at  the  larger  set 
sizes  is  accounted  for  by  postulating  that  the  relatively  weak  automatic 
processing  finishes  before  the  controlled  processing  when  many  controlled 
processing  comparisons  must  be  made.  It  is  suggested  that  the  "pop  out" 
effect  or  parallel  processing  of  the  visual  array  is  due  to  the  tuning  of  a 
perceptual  filter  for  the  consistently  mapped  targets  (see  also  Hoffman, 
1979;  LaBerge,  1981).  Evidence  for  both  of  these  performance 
characteristics  have  been  obtained  early  in  training  in  the  present  study. 

It  is  interesting  to  note,  however,  that  the  decreased  memory  load  slope  for 
the  higher  set  sizes  was  obtained  for  P300  latency  but  not  RT  during  the 
first  1440  trials  of  practice.  An  analysis  of  data  collected  subsequent  to 
the  first  session  of  training  indicated  that  RT  displayed  the  same  change  in 
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slope  after  approximately  3,000  trials.  This  pattern  of  results  suggest  that 
the  components  of  stimulus  evaluation  processing  influenced  by  the  memory 
load  manipulation  become  automated  at  a  faster  rate  than  the  components  of 
response  related  processing  that  are  affected  by  memory  load.  Thus,  P300 
latency  may  serve  as  an  early  marker  of  the  automation  of  stimulus 
evaluation  processing. 

In  addition  to  obtaining  the  reduced  memory  set  slope  in  the  CM 
conditions  early  in  training,  we  also  obtained  evidence  that  is  consistent 
with  the  previously  reported  "pop  out"  effect  in  visual  search  (Logan,  1985; 
Shiffrin  and  Dumais,  1981).  In  our  display,  targets  could  appear  in  either 
the  right  or  left  side  of  the  frame.  In  the  VM  conditions  subjects'  RT  and 
P300  latency  was  significantly  longer  when  the  target  occurred  on  the  right 
than  when  it  was  presented  on  the  left  side  of  the  frame  (34  and  29  msec 
differences  for  RT  and  P300,  respectively).  However,  even  in  the  first 
session  RT  and  P300  latency  did  not  differ  as  a  function  of  whether  the 
target  occurred  on  the  left  or  the  right  in  the  CM  conditions.  This  pattern 
of  results  was  uninfluenced  by  dual  task  demands  or  additional  practice. 
Thus,  consistent  with  Schneider's  (1985)  model  it  appears  that  rapid  and 
apparently  parallel  display  processing  develops  early  in  training  in  CM 
conditions. 

With  additional  consistent  practice  performance  is  predicted  to 
transition  from  phase  2  to  phase  3.  The  third  phase  is  characterized  by  a 
lack  of  effect  of  memory  load  on  performance  (i.e.  zero  slope).  It  is 
asserted  that  the  sequential  operations  of  controlled  processing  are  no 
longer  necessary.  However,  subjects  still  allocate  attention  to  perform 
that  task.  The  zero  slopes  obtained  for  RT  and  P300  latency  in  session  ten 
are  consistent  with  the  transitions  predicted  by  Schneider's  model. 
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Although  we  did  not  record  ERPs  in  sessions  2  through  9,  it  would  appear, 
based  upon  the  reduced  P300  latency  slopes  for  the  larger  set  sizes  in 
session  1 ,  that  the  P300  latency  slope  may  have  attained  the  zero  slope 
criterion  prior  to  RT.  Further  research  will  be  required  to  test  this 
hypothesis. 

In  addition  to  attaining  the  zero  slope  criterion  after  substantial 
consistently  mapped  practice,  we  also  found  another  interesting  effect  not 
predicted  by  Schneider's  model.  RT/P300  ratios  were  significantly  smaller 
in  session  10  than  they  were  in  session  1,  suggesting  that  subjects'  emitted 
fast  and  accurate  responses  with  less  evaluation  of  the  stimulus  array  after 
substantial  practice  on  the  search  task  (see  also  van  Dellen  et  al. ,  1985). 
It  is  interesting  to  note  that  although  this  reduction  in  RT/P300  ratio  was 
larger  in  the  CM  condition,  the  reduction  was  also  significant  in  the  VM 
condition.  Therefore,  it  appears  that  subjects  become  more  efficient  at 
extracting  the  relevant  information  from  a  visual  display  with  practice.  In 
some  sense  it  is  not  surprising  that  this  process  improves  for  both  CM  and 
VM  search  tasks.  The  VM  task  is  not  inconsistent  in  all  of  its  components, 
only  stimulus-response  mapping  changes  over  trials.  Thus,  even  in  the  VM 
task  a  limited  set  of  items  occurs  in  fixed  positions  on  the  display.  It 
appears  that  this  level  of  consistency  is  sufficient  to  enhance  the  process 
of  information  extraction,  although  additional  consistency  in  the  form  of 
stimulus-response  mapping  results  in  added  perceptual  efficiency. 

It  is  important  to  note  that  this  improved  efficiency  in  information 
extraction  appears  to  differ  in  several  respects  from  the  parallel  display 
processing  manifested  in  session  1.  First,  the  RT/P300  ratio  effect  was 
obtained  in  both  CM  and  VM  conditions  while  the  parallel  processing  effect 
was  obtained  in  the  CM  but  not  the  VM  conditions.  Second,  the  parallel 
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processing  effect  occurred  early  in  practice  while  the  changes  in  RT/P300 
ratio  were  coincident  with  the  zero  memory  set  slope.  Finally,  the  RT/P300 
ratio  was  influenced  by  dual  task  demands  while  the  parallel  processing 
effect  was  insensitive  to  the  introduction  of  the  tracking  effect  as  well  as 
an  increase  in  tracking  difficulty.  This  pattern  of  results  suggests  that 
the  improved  processing  efficiency  manifested  by  the  decreased  RT/P300  ratio 
develops  gradually  and  remains  a  limited-capacity  process,  at  least  within 
the  level  of  training  received  in  the  present  study.  Further  research  will 
be  necessary  to  determine  whether  this  process  becomes  less  sensitive  to 
dual  task  demands  with  additional  training. 

Phase  4  processing  occurs  when  automatic  processing  has  developed 
sufficiently  such  that  neither  attention  nor  controlled  processing  is 
necessary.  This  phase  is  characterized  by  a  resistance  to  dual  task  demands 
(i.e.  perfect  time  sharing  criterion).  It  has  been  asserted  that, 
"Consistently  mapped  training  appears  to  be  a  necessary  condition  for 
improvements  in  the  subjects'  ability  to  timeshare  tasks"  (Schneider  and 
Fisk,  1982;  p.  161).  However,  although  consistently  mapped  training  may  be 
a  necessary  condition  for  improved  timesharing  it  is  not  sufficient.  For 
example,  Schneider  (1985)  has  qualified  the  conditions  under  which 
transitions  occur  from  phase  3  to  phase  4  processing  by  stating  that,  "Phase 
4  processing  may  not  operate  effectively  if  stimuli  are  severely  degraded" 
(p.  489). 

In  the  present  study  we  have  extended  the  line  of  research  which 
suggests  that  automatic  processing  develops  for  task  components  rather  than 
tasks  (Jonides  et  al.,  1985;  Logan,  1985)  by  showing  that  even  after  20,000 
trials  of  practice  with  two  supra-threshold  tasks,  dual  task  performance 
decrements  in  the  form  of  increased  memory  set  intercepts  were  the  same  for 
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VM  and  CM  conditions.  The  dual  task  interactions  were  predicted  on  the 
basis  of  previous  data  that  have  suggested  that  the  processing  demands  of 
the  tracking  task  and  the  memory  comparison  process  do  not  overlap  (Wickens 
et  al.,  1980).  Thus,  the  spare  processing  capacity  liberated  by  the 
automation  of  the  comparison  process  would  not  be  predicted  to  be  of  benefit 
to  the  processing  performed  in  the  tracking  task  (Wickens,  1980).  It  is 
interesting  to  note  that  we  found  perfect  timesharing  performance  of  the  CM 
but  not  the  VM  search  task  with  a  recognition  running  memory  task  after  the 
same  amount  of  practice  that  subjects  received  in  the  present  study  (Kramer 
and  Strayer,  1987b).  Therefore,  it  cannot  be  argued  that  subjects  were 
given  insufficient  training  to  achieve  phase  4  processing  in  this  study. 
Instead,  this  pattern  of  results  would  seem  to  argue  for  the  inclusion  of  an 
additional  qualification  on  the  transition  from  phase  3  or  capacity-limited 
to  phase  4  or  capacity-free  processing.  This  transition  appears  to  be 
limited  (a)  to  task  combinations  in  which  the  capacity  liberated  by  the 
automatic  processing  may  be  used  by  the  concurrent  task  —  the  Multiple 
Resource  view  of  processing  resources  and  (b)  the  task  stimuli  must  be 
presented  at  supra-threshold  levels  (Hoffman,  Nelson,  and  Houck,  1983; 
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Footnotes 

1  All  post-hoc  comparisons  reported  in  this  article  are  computed  with  the 
Bonferroni  t-test  and  are  significant  at  p<.05. 

2  For  the  purposes  of  the  present  study,  we  defined  the  intercept  as  the 
average  RT  obtained  in  the  memory  set  size  two  condition.  This  was  necessary 
because  of  the  flat  slope  in  the  practiced  CM  condition. 
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Table  1 

Means  (msec)  and  standard  deviations  (SD)  for  the  three  different  response 
types  in  both  CM  and  VM  conditions.  Note:  NT  is  negative  trial,  PL  is  positive 
left  trial,  and  PR  is  positive  right  trial.  P30Q's  have  been  recorded  at  Pz. 

CM  VM 


NT 

PL 

PR 

NT 

PL 

PR 

RT 

577 

519 

515 

671 

584 

618 

SD 

67 

51 

40 

71 

50 

61 

P300  Latency 

605 

547 

551 

620 

570 

599 

SD 

67 

47 

53 

64 

54 

47 
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Table  2 

Mean  error  rates  for  all  of  the  dual  task  Sternberg  conditions.  Note:  SI  is 
session  l,  S2  is  session  10. 

CM  VM 


Memory 

Set  Size 

Dual 

First 

Dual 

Second 

Dual 

First 

Dual 

Second 

SI 

S2 

SI 

S2 

SI 

S2 

SI 

S2 

Targets : 

Set  Size  2 

l  .0 

.6 

1.2 

1.0 

1.9 

1.6 

2.6 

1.7 

Set  Size  3 

1.4 

.7 

1 .4 

1.2 

2.4 

2.3 

2.5 

2.3 

Set  Size  4 

1.4 

.4 

1.5 

.8 

3.6 

2.2 

3.7 

2.3 

Non-targets : 

Set  Size  2 

.9 

.7 

2.2 

1.0 

1.1 

.8 

4.7 

1.8 

Set  Size  3 

1.6 

.7 

2.2 

1.1 

3.4 

1.5 

5.0 

2.1 

Set  Size  4 

2.0 

1 .4 

3.1 

1.2 

6.1 

1.4 

6.2 

2.5 
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Table  3 

Mean  P300  latencies  at  Pz  (msec)  for  all  of  the  dual  task  Sternberg  conditions. 
Note:  SI  is  session  L,  S2  is  session  10. 

CM  VM 


Memory 

Set  Size 

Dual 

First 

Dua  1 

Second 

Dual 

First 

Dual 

Second 

SI 

S2 

SI 

S2 

SI 

S2 

SI 

S2 

Targets : 

Set  Size  2 

555 

553 

559 

54  7 

586 

605 

587 

585 

Set  Size  3 

597 

555 

600 

558 

606 

621 

605 

610 

Set  Size  A 

604 

564 

605 

554 

639 

645 

627 

630 

Non-targets : 

Set  Size  2 

618 

615 

642 

610 

624 

640 

606 

636 

Set  Size  3 

636 

627 

644 

626 

630 

666 

620 

665 

Set  Size  4 

64) 

627 

637 

629 

638 

681 

648 

673 
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Table  4 

Mean  RT/PJOO  ratios  for  all  of  the  dual  task  Sternberg  conditions.  Note:  SI  is 
session  l,  S2  is  session  10. 

CM  VM 


Memory 

Set  Size 

Dual 

First 

Dual 

Second 

Dual 

First 

Dual 

Second 

SI 

S2 

SI 

S2 

SI 

S2 

SI 

S2 

Targ  ets : 

Set  Size  2 

1 .04 

.88 

1.06 

.93 

1.04 

.89 

1.14 

.98 

Set  Size  3 

.99 

.85 

1.08 

.94 

1.12 

.92 

1.18 

1.00 

Set  Size  4 

1.12 

.91 

1.09 

.96 

1.17 

.99 

1.22 

1.05 

Non-targets : 

Set  Size  2 

1  .04 

.87 

1.13 

.90 

1.12 

.92 

1.23 

1.01 

Set  Size  3 

1.08 

.88 

1.18 

.93 

1.25 

.96 

1.29 

1.05 

Set  Size  4 

1.25 

.88 

1.24 

.94 

1.40 

1.11 

1.36 

1.08 
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Figure  Capt ions 


Figure  J_.  A  graphic  representation  of  the  task  conf iguration.  The  subjects 
controlled  the  cursor  in  the  tracking  task  with  the  right  hand  and  responded  to 
the  Sternberg  probes  with  the  left  hand.  Sternberg  probes  were  presented  in 
the  rectangular  target.  Memory  sets  were  displayed  in  the  center  of  the 
display,  directly  above  the  tracking  task. 

Figure  2.  The  temporal  structure  of  the  tracking  and  Sternberg  tasks  both 
separately  and  together. 

Figure  3.  Average  single  task  RTs  for  each  of  the  experimental  conditions  in 
sessions  1  and  10. 

F igure  Average  single  task  error  rates  for  each  of  the  experimental 

conditions  in  sessions  1  and  10. 

F igure  _5.  Grand  average  ERPs  recorded  at  the  parietal  site  for  all  of  the 
single-task  Sternberg  conditions  in  sessions  1  and  10. 

F igure  b_.  Average  single  task  P300  latencies  recorded  at  Pz  for  each  of  the 
experimental  conditions  in  sessions  1  and  10. 

F igure  _7 .  Average  single  task  RT/P300  ratios  for  each  of  the  experimental 
conditions  in  sessions  1  and  10. 

F igure  _8.  Average  Sternberg  RTs  for  each  of  the  dual  first  order  experimental 
conditions  in  sessions  1  and  10. 

F igure  _9 .  Average  Sternberg  RTs  for  each  of  the  dual  second  order  experimental 
conditions  in  sessions  1  and  10. 

F igure  10.  Grand  average  ERPs  recorded  at  the  parietal  site  for  all  of  the 
dual-task  first  order  Sternberg  conditions  in  sessions  1  and  10. 

F igure  1 1 .  Grand  average  ERPs  recorded  at  the  parietal  site  for  all  of  the 
dual-task  second  order  Sternberg  conditions  in  sessions  l  and  10. 
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cause  the  correct  resprinse  has  been  prepared.  6.  RTs  arc  comparable  among  the  young  and  the  old  in  these  reasonably 

Squires,  Wickens,  Squires,  and  Donchin  (1976)  were  the  first  to  apply  this  uncomplicated  tasks.  This  last  finding  is  notable,  since  studies  in  which  RT 

analysis  to  ERP  data  An  important  clement  of  this  study  was  its  demonstration  is  the  only  dependent  measure  of  processing  speed  have  consistently 

of  the  crucial  role  that  subjective  expectancy  (rather  than  objective  probability  reported  a  slowing  of  response  speed  with  age  at  all  levels  of  processing 
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GENERALIZED  IMPLEMENTATION  OF  AN  EYE  MOVEMENT  CORRECTION  PROCEDURE 
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Title 

Generalized  Implementation  of  An  Eye  Movement  Correction  Procedure 
Authors 

Gregory  A.  Miller,  GaDriele  Gratton,  and  Cindy  M.  Yee,  University  of 
Illinois  at  Urbana-Champaign 
Purpose 

Removal  of  EOG  artifact  from  EEG  data  off-line,  distinguishing  blink 
and  saccade  artifact. 

Description 

The  eye  movement  correction  procedure  algorithm  proposed  by  Gratton  and 
colleagues  (Gratton,  G.,  Coles,  &  Donchin,  E.,  A  new  method  for  off¬ 

line  removal  of  ocular  artifact,  Electroencephalography  and  clinical 
neurophysiology,  1983,  ^5,  468-484}  has  been  implemented,  with  extensions, 
in  standard  DEC  Fortran  IV.  Extensions  are  principally  (a)  new 
computational  efficiences  and  (b)  suitability  for  use  with  both  vertical  and 
horizontal  EOG  channels  rather  than  vertical  EOG  alone.  Single-trial  data 
are  screened  for  blinks  and  bad  data  (the  latter  defined  as  either  a  value 
off-channel  or  an  extended  run  of  identical  values,  which  can  occur  if  a 
baseline  has  already  been  removed  from  off-channel  data).  Raw  averages  are 
computed  for  each  channel  and  experimental  condition,  separately  for  blink 
and  blink-free  data  points.  These  averages  are  subtracted  from  the 
single-trial  data.  Correlations  are  then  computed  between  the  remainder  for 
each  EEG  channel  and  the  remainder  for  the  EOG  channel,  separately  for  blink 
and  blink-free  data  points,  producing  blink  and  saccade  correction  factors 
for  each  EEG  channel.  These  correction  factors  are  then  applied  to  the 
original  raw  averages.  Optionally,  in  an  extra  pass  on  the  input  file, 
single  trials  are  also  corrected,  using  the  same  correction  factors. 
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Key  Methodological  Issues 

(1)  Artifact  estimates  are  derived  from  the  data  the  user  wishes  to 
correct;  not  solely  from  pre-stimulus  or  baseline  data,  from  instructed  eye 
movements,  or  from  other  special  recording  epochs.  (2)  Placement  of  EOG 
electrodes  must  De  considered  carefully,  with  attention  to  the  specific  EEG 
channels  to  De  corrected.  We  have  used  the  algorithm  only  with  Dipolar, 
ci rcum-ocul ar  channels,  and  we  recommend  strict  vertical  and  horizontal  EOG 
recordings.  A  monopolar  EOG  referred  to  the  mastoid,  for  example,  would  be 
vulnerable  to  EEG  contamination.  A  bipolar  EOG  placement  at  45  degrees  from 
the  vertical  would  confound  vertical  and  horizontal  eye  movements.  (3) 
Artifact  estimates  and  corrections  are  performed  separately  for  blinks  and 
saccades.  All  data  points  not  determined  to  coincide  with  blinks  are  used 
in  computing  the  saccade  correction  factor  for  each  channel.  (4)  Correction 
factors  are  computed  from  all  single-trial  data  following  removal  of  average 
data  for  a  given  trial's  experimental  condition.  That  is,  the  average  ERP 
for  all  trials  in  a  condition  is  subtracted  from  each  of  the  single  trials 
in  that  condition,  for  each  EOG  and  EEG  channel.  The  user  decides  what  the 
set  of  "conditions"  is.  In  general,  each  within-subject  cell  of  the 
experimental  design  expected  to  have  a  different  average  waveform  should  be 
declared  as  a  separate  condition.  After  subrection  of  the  relevant  average, 
all  trials  contribute  to  the  computation  of  a  single  blink  and  a  single 
saccade  correction  factor  for  each  channel.  (5)  Corrections  based  on 
multiple  channels  of  EOG  are  handled  in  multiple  runs  of  the  program.  Thus, 
if  non-midline  EEG  data  are  recorded,  and  both  vertical  and  horizontal  EOG 
are  available,  the  user  would  set  up  the  first  run  of  the  program  to  treat 
the  horizontal  channel  identically  with  the  EEG  channels,  removing  vertical 
EOG  artifact  (blinks  and  the  verti cal -axi s  component  of  saccades).  In  a 


second  run,  the  residual  in  the  EEG  channels  would  be  corrected  for 
correlations  with  the  residual  in  the  horizontal  EOG  channel 
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(horizontal-axis  saccades). 

Hardware  Character!' sties 

The  program  was  developed  on  a  DEC  PDP-11/73  using  single-precision 
(32-bit)  real  arithmetic  and  arrays.  Space  needs  for  data  arrays  can  he 
large  for  some  combinations  of  number  of  channels,  points  per  channel,  and 
number  of  experimental  conditions  (number  of  trials  has  little  impact  on 
array  usage).  One  version  of  the  program  (7  channels,  100  points,  4 
conditions,  4000  trials)  requires  108KB.  Another  version  (4  channels,  125 
points,  11  conditions,  1500  trials)  requires  148KB.  For  DEC  f'DP-11 
computers,  sufficient  memory  available  for  virtual  arrays  is  necessary  in 
most  cases. 

Software  Characteristics 

The  eye  movement  correction  program  was  originally  written  in  Harris 
Fortran  F66,  a  proprietary  structured  language.  The  program  has  been 
re-written  in  DEC  Fortran  IV  (compiler  version  2.6),  running  under  RT-11  or 
TSX-Plus.  It  is  highly  compatible  with  other  Fortran  iv  and  Fortran-77 
compilers,  as  efforts  were  made  in  coding  to  facilitate  porting  to  other 
compilers. 

1/0  Characteri sties 

Standard  DEC  RT-11  Fortran  files  are  read  and  written.  A  user-edited 
specification  file  directs  program  execution,  such  as  whether  blink 
detection  and  correction  is  to  be  done  (e.g.,  it  would  not  be  if  correction 
is  based  on  a  horizontal  EOG  channel),  and  the  value  beyond  which  data  are 
to  be  considered  off-channel. 

Performance  Notes 
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Runtime  depenas  on  the  product  of  number  of  trials,  channels,  and 
points.  Execution  time  for  4000  trials,  7  channels,  100  points,  and  4 
conditions  in  unformatted  binary  files  on  moderately  fast  hard  disk,  is  110 
minutes  for  average-ERP  correction  and  optionally  another  60  minutes  for 
single-trial  correction.  For  4  channels,  125  points,  and  11  conditions, 
execution  time  for  1500  trials  is  30  minutes  for  average-ERP  correction  and 
optionally  another  30  minutes  for  single-trial  correction. 

A1 gori thm  Porting  Considerations 

(1)  Each  user  should  experiment  with  optimal  settings  for  the  blink 
detection  criterion,  depending  on  local  amplification.  (2)  The  correction 
works  poorly  for  EEG  channels  positioned  very  close  to  the  E0G  site,  due  to 
the  EEG  signal  appearing  in  the  E0G  channel.  Thus,  a  small  amount  of  true 
EEG  common  to  both  Fpz  EEG  and  supra-orbi tal  EOG  recordings  will  lead  the 
correction  algorithm  to  remove  that  portion  of  the  EEG  from  the  Fpz  channel. 
Source  Code  Porting  Considerations 

(1)  Array  space  needs  can  be  large.  The  program  runs  without 
modification  on  a  PDP-11/23  with  22-bit  addressing  (run-time  is 
approximately  doubled).  The  program  has  been  ported  successfully  to  a  256KB 
PDP-11/34  and  to  a  256KB  POP- 1 1/ 73  at  other  universities.  (2)  File  1/0 
format  must  be  customized  to  the  user's  data  files.  The  program  currently 
uses  unformatted  binary  files  for  speed  and  space  reasons  and  utilizes  trial 
identification  intormation  stored  within  the  record  for  each  channel. 

Program  Avai 1 abi 1 ity 

Source  code  listing;  copy  on  user-supplied  DEC  RT -11  9-track  magtape  at 
800  or  1600  BPI;  TU58  tape;  RK05,  RL01 ,  or  RL02  disk;  dial-up  modem;  or 
BITNET.  Sample  data  and  output  are  available. 


Documentation 
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Source  code  is  extensively  commented.  A  user's  manual  is  available. 
Support  Available 

The  authors  are  available  for  consultation. 
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Article  1 


EPIC 

Applications  of  ERPs  to  Human  Factors 
Christopher  D.  Wickens 

In  his  paper.  Dr.  Parasuraman  has  eloquently  made  the  case  for  the 
modest,  yet  significant,  contribution  that  the  Event  Related  Brain  Potential 
(ERP)  may  make  to  the  discipline  of  Human  Factors,  and  he  has  as  well 
provided  a  nice  overview  of  many  of  the  most  promising  applications.  I 
applaud  both  his  call  for  careful  theory-based  applications,  as  well  as  his 
caution  for  exercising  restraint  when  describing  the  real  contributions  that 
the  measures  can  make.  The  objectives  of  my  presentation  here  will  be 
twofold:  (1)  to  outline  what,  from  my  perspective,  are  the  necessary 
criteria  for  complete  and-successful  applications  of  ERPs  to  human  factors 
systems  design  and  evaluation,  and  in  what  way  these  criteria  have  been  or 
can  be  met.  (This  outline  will,  I  believe,  reveal  some  of  the  reasons  why 
applications  are  difficult  to  achieve  and,  therefore,  why  restraint  in 
promises  is  necessary),  (2)  to  review  Dr.  Parasuraman' s  list  of  candidate 
applications  and  to  expand  upon  this  list  slightly  by  adding  some  of  my  own. 

It  should  be  noted  in  opening  that  the  sequence  of  steps  from  the 
identification  of  an  interesting  ERP  phenomenon  to  the  successful 
applications  in  system  design  is  a  long  one  that  will  not  be  easily 
traversed.  Yet,  the  frustrations  Inherent  in  this  delay  are  not  unique  to 
the  ERP  field,  as  the  time  lag  for  applications  of  basic  research  In 
engineering  and  cognitive  psychology  is  sometimes  equally  long  and  tortuous 
(Adams,  1972;  Rouse,  1985).  Many  lessons  learned  from  the  latter  are 
equally  relevant  to  the  former,  and  will  be  elaborated  below.  Yet, 
electrophyslological  signals  have  some  unique  characteristics  not  shared  by 
the  more  conventional  manual  and  vocal  performance  measures,  that  dictate 


special  concerns  in  the  course  of  application.  Before  this  course  and  the 
relevant  concerns  are  outlined  however,  1  shall  briefly  sunmarlze  what  I  see 
to  be  the  three  most  relevant,  and  generic  human  engineering  concerns,  for 
which  ERPs  may  provide  general  answers.  These  are: 

(1)  How  to  choose  which  Is  the  "best"  system  (or  system  component). 
Such  a  choice  may  Involve  a  simple  comparison  of  performance.  But 
if  performance  Is  equal  between  two  systems,  or  is  difficult  to 
measure  the  choice  may  require  a  comparison  of  the  cost  of 
performance  in  terms  of  mental  workload.  And  if  the  system  of 
greater  workload  is  more  expensive  in  dollars  to  manufacture  and 
operate,  then  the  choice  requires  a  quantitative  scale  of  workload 
that  may  be  balanced  against  dollars. 

(2)  How  to  predict  human  error.  Will  models  of  human  performance  be 
accurate  enough  to  predict  when  a  human  operator,  interacting  with 
a  particular  system,  may  produce  an  error  with  major  consequences? 
Can  errors  of  perception,  of  attention,  of  memory,  and  of  response 
be  predicted  and  discriminated? 

(3)  How  to  assess  the  level  of  skill  acquired  by  an  operator  through 
training.  If  performance  reaches  asymptote,  will  the  workload 
continue  to  be  reduced?  At  what  point  has  training  clearly  become 
automated? 

These  then  are  the  objectives  toward  which  ERP-based  applications  should  be 
focused. 


Criteria  for  Application 


In  order  to  apply  ERPs  to  answer  the  questions  outlined  above,  the 
cognitive  psychophysiologist  must  first  address  the  question  that  will 


undoubtedly  be  asked  by  the  HF  practitioner:  "Why  not  use  more  conventional 
measures  of  performance  —  either  manual  responses  or  vocal  reports  —  to 
address  the  same  human  engineering  issues?"  The  answer  to  this  question  is 
that  ERPs  can  provide  information  that  is  either  obtained  at  a  greater  cost 
by  using  performance,  or  cannot  be  obtained  at  all  by  performance  measures. 
(Hereafter  the  term  "performance"  will  be  used  to  refer  to  these 
conventional  behavioral  measures.  In  contrast  to  the  event-related  potential 
measures.)  The  first  issue  —  that  of  relative  cost  will  be  deferred  until 
later.  However,  the  second  issue  will  be  discussed  extensively  below, 
because  it  is  critically  important  to  the  issue  of  applications. 

Dissociation  and  Association 

The  idea  that  ERPs  can  provide  information  unavailable  in  performance 
invokes  the  concept  of  dissociation,  depicted  in  Figure  1.  Here  two  forms 
of  variance  —  in  physiological  measures  and  in  performance  —  are  depicted. 
These  two  variances  define  three  regions.  In  the  overlapping  region  (1), 
performance  and  ERPs  covary.  In  the  region  to  the  left  (Region  2),  ERPs 
show  unique  variance  that  is  not  revealed  in  performance.  This  variance 
provides  Information  to  the  extent  that  it  correlates  with  some 
experimenter-defined  manipulation,  or  subject-reported  strategy.  In  the 
region  to  the  right  (Region  3),  ERPs  fail  to  reflect  changes  shown  by 
performance.  Regions  2  and  3  then  define  dissociation.  The  existence  of 
Region  2  and,  less  obviously.  Region  3  both  provide  answers  to  the  question 
"Why  not  performance?"  But,  In  order  to  satisfactorily  answer  this 
question,  it  will  be  necessary  for  the  cognitive  psychophysiologist,  working 
in  consort  with  the  Human  Factor  practitioner,  to  traverse  through  the 
sequence  of  logical  steps  shown  in  Figure  2. 


The  sequence  is  entered  at  point  1,  where  an  important  source  of 
dissociation  is  identified.  For  example,  ERPs  may  serve  as  a  measure  of 
auditory  attention  allocation,  that  is  impossible  to  observe  through 
performance  (e.g.,  Hansen  and  Hlllyard,  1980).  The  question  of  which 
messages  may  be  attended  and  which  Ignored  can  be  a  critical  one  in  an 
operational  environment  such  as  the  aircraft  cockpit.  At  this  point,  the 
system  designer  may  rightfully  ask  if  the  dissociation,  rather  than 
reflecting  meaningful  unique  variance  (region  2  or  figure  1),  or  an 
interpretable  lack  of  effect  (region  3),  is  in  fact  either  noise  or 
insensitivity  (a  ceiling  or  floor  effect).  To  convince  the  practitioner 
that  the  ERP  measure  is  meaningful  thus  requires  that  the  psychophysiologist 
take  a  step  back,  to  point  2  of  figure  2,  and  demonstrate  an  association. 
That  is,  provide  an  experiment  in  which  the  ERP  correlates  directly  with 
performance  as  In  Region  1  of  Figure  1.  Examples  of  such  studies  are  those 
in  which  P300  is  correlated  with  manipulations  of  difficulty,  which  also 
affects  performance  (e.g.,  Isreal,  Wickens,  Chesney,  &  Donchin,  1980). 

These  validation  studies  of  course  contribute  nothing  directly  to  the  value 
of  the  ERP  measures  as  a  system  design  tool,  except  to  persuade  the  HF 
practitioner  of  the  meaningful  ness  and  trustworthiness  of  the  measure,  so 
that  a  logical  case  can  be  established  as  to  why  the  measure  dissociates  In 
some  situations  (Kahneman,  1982  SPR  Address). 

Once  the  HF  practitioner  has  been  convinced  of  the  validity  of  the 
measure,  then  potentially  valuable  information  may  be  gained  from  either  of 
the  two  types  of  dissociation.  An  example  of  Type  3  dissociation  was 
provided  by  an  experiment  of  Isreal,  Chesney,  Wickens,  and  Donchin  (1980), 
in  which  It  was  observed  that  the  ERP  amplitude  of  the  P300  component  did 
not  change,  even  as  the  demand  of  a  tracking  task  was  varied  by  changing  Its 


bandwidth.  Such  constancy  of  the  ERP  was  interpreted  in  light  of  a  process 
model  of  P300.  This  model  posited  that  the  amplitude  depended  only  upon  the 
availability  of  perceptual /cognitive  resources,  which  were  undepleted  by  the 
response-load  manipulation  of  tracking  task.  As  a  consequence,  the  unique 
diagnosticity  of  the  ERP  to  reveal  perceptual/cognitive  load  was 
demonstrated.  A  similar  demonstration  was  provided  by  Kramer,  Wickens,  and 
Donchin  (1983),  who  attributed  the  lack  of  reduction  of  secondary  task  P300 
amplitude  with  practice  on  a  primary  target  acquisition  task,  to  the 
development  of  response-related  strategies.  In  a  more  general  sense.  Type  3 
dissociations  of  P300  and  performance  can  be  reasonably  and  measurably 
interpreted  whenever  the  manipulations  are  logically  those  that  Involve 
either  the  resource  demands  (Kramer,  et  al.,  1983;  Isreal,  Chesney,  et  al., 
1980),  or  the  latency  (McCarthy  &  Donchin,  1981)  of  response-related 
processes  (see  Figure  3). 

The  importance  of  Type  2  dissociations  are  easier  to  justify  to  the 
human  factors  practitioner.  This  is  because  of  the  plethora  of  perceptual 
and  attentional  phenomena  directly  indexed  by  ERP  components,  which  are  not 
revealed  by  performance  measures.  These  dissociations,  in  fact,  increase  in 
their  importance  in  human  factors  to  the  extent  that  automated  systems 
evolve  which  demand  continuous  monitoring  and  understanding  with  only 
minimal  requirements  for  the  operator  to  Intervene  with  overt  responses 
(Hart  &  Sheridan,  1984).  Therefore,  under  such  conditions  (and  their 
laboratory  analogs),  performance  data  are  simply  not  available.  Here  again 
however,  as  In  the  case  of  Region  2,  the  establishment  of  a  meaningful  and 
useful  dissociation  must  be  based  upon  a  theory-based  or  processed-based 
explanation  of  why  the  ERP  should  be  expected  to  vary  in  the  way  that  It 
does.  Thus  for  example,  theories  of  visual  or  auditory  selective  attention 


are  necessary  to  establish  the  conditions  under  which  attention  allocated  to 
a  visual  or  auditory  stimulus  would  be  expected  to  enhance  some  component  of 
the  ERP  (Parasuraman,  1985;  Hansen  and  Hillyard,  1980). 


"Real  World"  Complexity 

The  issue  of  how  to  successfully  demonstrate  a  dissociation  between 
performance  and  ERPs  in  a  manner  that  will  convince  the  human  factors 
practitioner  of  the  value  of  the  latter,  brings  up  a  point  addressed 
directly  in  Parasuraman' s  paper  —  the  extent  of  "real  world  complexity" 
that  it  is  necessary  to  design  Into  the  paradigm.  Here  the  views  of  the 
psychophsiologlst  (generally  reflecting  those  of  the  cognitive  and 
-  experimental  psychologist)  may  come  Into  partial  conflict  with  those  of  the 
engineering  psychologist.  The  experimental  or  cognitive  psychologist  will 
likely  feel  that  good  and  careful  experimental  control  of  a  phenomenon  in  a 
laboratory  setting  Is  not  only  necessary,  but  also  sufficient  to  establish 
the  relevance  and  validity  of  that  phenomenon  as  an  aid  to  systems  design 
questions.  I  agree  with  this  supposition  only  in  part,  as  qualified  by  the 
slightly  different  perspective  of  the  engineering  psychologist.  This 
perspective  argues  that  two  further  factors  must  be  taken  Into  account 
beyond  careful  controls,  both  of  which  make  it  Important  that  the 
dissociation  be  demonstrated  in  environments  of  greater  complexity. 

(1)  Many  Human  Factors  practitioners  whose  objective  it  is  to  convince 
will  rightly,  or  wrongly,  fall  to  be  convinced  unless  the  dissociation  Is 
demonstrated  in  an  environment  that  has  at  least  some  of  the  trappings  of 
real  world  complexity.  For  example,  according  to  this  view,  a  convincing 
demonstration  of  the  ERP  as  an  aviation  workload  measure  must  be  provided  in 
the  aircraft's  simulator,  and  not  just  in  the  tracking  laboratory.  However, 


it  is  equally  important  to  realize  that  the  increased  complexity  should  not 
be  bought  at  the  price  of  a  shoddy,  poorly  designed  study.  Increases  in 
complexity  will  not  compensate  for  deficiencies  in  experimental  protocol. 

It  Is  unfortunate  for  example  that  many  potentially  valuable  demonstrations 
have  proven  to  be  relatively  worthless  because  small  sample  sizes  have  been 
used  to  compensate  for  the  Increased  expenses  incurred  in  conducting  hese 
more  complex  studies. 

(2)  More  Importantly  from  a  fundamental  applications  standpoint,  it  is 
necessary  to  establish  that  the  variance  in  the  processing  phenomenon 
reflected  by  the  electrophysiological  measure,  is  sufficiently  large, 
relative  to  the  sources  of  uncontrolled  variance  that  exist  in  the  real 
world,  that  the  former  will  have  a  meaningful  impact  on  system  performance. 
Suppose  for  example,  it  is  established  that,  with  all  variables  tightly 
controlled  in  the  laboratory,  a  particular  ERP  component  is  sensitive  to  a 
manipulation  of  resource  allocation.  But  when  some  of  those  controls  are 
released  (as  they  must  be  outside  of  the  controlled  laboratory  setting),  and 
small  random  effects  now  perturb  the  measure,  the  variance  in  ERPs  due  to 
the  original  phenomenon!  Is  now  swamped.  In  such  an  Instance,  the  true  value 
of  the  ERP  measure  to  the  HF  practitioner  would  be  greatly  diminished.  Yet 
this  diminished  worth  would  never  have  been  revealed  until  the  tight 
controls  were  lifted. 

This  requirement  to  Incorporate  some  level  of  complexity  in 
experimental  demonstrations  of  a  dissociation  should  not  be  viewed 
pessimistically  by  the  psychophysiologist.  In  fact,  when  Incorporating  real 
world  complexity  brings  about  an  Increase  In  overall  workload,  such  an 
Increase  may  even  enhance  the  prominence  of  the  processing  phenomenon  under 


investigation  (e.g.,  resource  availability  or  the  criticality  of  resource 
allocation). 


An  example,  demonstrating  the  Importance  of  being  able  to  relax 
excessive  experimental  control,  involves  the  concern  for  control  of  eye 
fixation.  This  concern  transcends  much  of  the  experimentation  with  visual 
ERPs  and  with  non-electrophysiological  studies  of  perception  and  cognition 
as  well.  The  view  is  sometimes  expressed  that  the  investigator  must  know 
the  precise  location  of  fixation  in  order  to  draw  valid  inferences.  Vet, 
conclusions  that  are  drawn  only  from  the  eye-controlled  scan-free  laboratory 
may  be  tenuous  when  applied  to  the  wide-field  visual  world,  and  such 
conclusions  legitimately  may  be  distrusted  by  the  human  factors 
practitioner.  In  fact,  such  single-minded  concerns  with  fixations  are 
unnecessary.  For  example,  both  Wickens,  Heffley,  Kramer  and  Oonchin  (1980), 
and  Kramer,  Wickens,  and  Oonchin  (1983)  have  drawn  some  strong  conclusions 
concerning  the  allocation  of  visual  attention  to  a  complex  "free-scan" 
display,  with  many  of  the  uncontrolled  aspects  of  the  natural  world  present. 
These  conclusions  gain  in  their  validity  precisely  because  visual  fixations 
were  not  tightly  constrained. 

Design  Impact 

Given  that  the  feasibility  of  ERP  applications  has  been  demonstrated  in 
Step  3  (by  iterating  back  to  Step  2  when  necessary  to  convince  the  skeptic 
by  demonstrating  association),  the  next  step  (4  1i  figure  2)  is  taken  when 
the  ERP  measure  actually  Impacts  on  the  design  of  a  system.  Parasuraman's 
discussion  nicely  Illustrates  the  example  of  Frustrahofer' s  study  In  which 
EEG  provided  a  measure  of  arousal  that  dissociated  from  performance.  As 
another  example,  the  electrophysiological  data  using  steady  state  ERPs 
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provided  by  Gomer  and  Bish  (1978)  was  used  to  influence  the  choice  of 
formats  for  alphanumeric  displays. 

At  Step  4,  there  are  really  three  levels  of  impact  that  ERP  measures 
may  have  on  a  design  process.  First,  they  may  be  used  to  justify  a 
particular  design  decision  because  they  reinforce  behavioral  measures,  or 
design  considerations,  already  available.  Such,  for  example,  would  be  the 
case  if  an  ERP-based  workload  measure  was  used  to  influence  the  choice  of 
one  display  format  over  another,  in  agreement  with  ratings  obtained  by 
subjective  measures.  Secondly,  ERP  measures  may  provide  information 
unavailable  through  performance  —  the  Type  3  dissociation  --  although  this 
information  may  be  consistent  with  the  human  factors  practitioner' s  “gut 
beliefs."  Here,  for  example,  are  ERP  measures  which  may  show  that  a  warning  - 
light  on  a  multi-element  visual  display  is  unattended. 

The  third  level  of  Impact  is  revealed  in  Step  5,  and  is  one  that  would 
bring  greatest  satisfaction  to  the  psychophysiologist  that 
electrophysiological  measures  have  truly  "come  of  age."  This  would  describe 
a  situation  in  which  the  human  factors  practitioner  made  a  different  design 
decision  as  a  consequence  of  the  ERP  measures  than  would  have  been  made  in 
their  absence.  Such  a  course  of  action  would  mean  that  the  implication  of 
the  ERP  measure  for  design  was  pitted  against  the  implications  of  an 
alternative  design  consideration  (cost,  performance,  or  the  practitioner’s 
"gut  belief"),  and  the  ERP  measure  was  victorious. 

The  last  step  (6)  In  the  transition  to  applications,  and  the  ultimate 
reinforcement  for  the  psychophysiologist,  is  provided  when  it  is  determined 
that  a  system,  designed  with  the  aid  of  ERP  measures,  was  actually  more 
effective  than  one  designed  without.  Unfortunately,  such  success  stories 
are  extremely  hard  to  document,  because  the  "control  system-designs"  (those 
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systems  that  are  designed  by  equivalent  procedures  except  for  the  Input 
provided  by  the  ERP  measure),  are  rarely  available  to  be  compared.  It  Is 
here  an  act  of  faith  to  assume  that  the  control  system-design  would  not  have 
been  as  effective,  had  It  been  completed.  It  is  unfortunate  that  the 
absence  of  such  necessary  control  data  to  evaluate  the  effectiveness  of 
design  tools  is  a  widespread  symptom  In  human  factors,  and  hampers  the 
evaluation  of  the  usefulness  of  purely  performance-based  models  and  tools  as 
well . 

While  the  path  to  ultimate  success  for  ERP  applications  from  Step  1  to 
Step  6  appears  long  and  tortuous.  It  is  important  to  realize  that  partial 
successes  may  be  very  much  realized  at  earlier  steps  along  the  way.  For 
example,  as  shown  toward  the  bottom  of  Figure  2,  well  designed  ERP 
demonstrations  of  dissociation  can  have  an  Impact  on  the  general  theory  of 
human  performance;  ultimately,  but  with  less  direct  connections,  the  revised 
theory  may  in  turn  influence  system  design  (though  at  the  time  of  that 
Influence,  the  direct  contributions  of  the  ERP  may  not  be  recognized  or 
acknowledged).  For  example,  the  investigations  carried  out  on  the  ERP  as  a 
workload  measure  by  Isreal  et  al.  (1980a,  1980b)  have  had  a  direct  input 
toward  establishing  the  stage-of-processing  dimension  as  an  Important 
component  of  the  multiple  resource  model  of  human  time-sharing  performance. 
This  model  Itself,  Is  now  becoming  a  useful  tool  in  system  design  (North, 
1985). 

Applications 

Parasuraman  has  nicely  described  applications  In  the  areas  of  workload 
assessment  and  resource  allocation.  I  would  like  now  to  add  some  further 
comments  to  his  remarks  on  the  applications  of  ERPs  to  these  areas  of 
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assessment,  and  also  to  identify  one  additional  area  to  which  I  believe  ERP 
measures  have  the  potential  to  contribute  substantially. 

Workload 

First,  I  would  like  to  address  the  concern  that  the  P300  as  a  secondary 
task  workload  measure  may  be  more  obtrusive  into  primary  task  performance 
than  is  sometimes  claimed  by  proponents  of  the  measure.  My  answer  is  that 
the  measure  is  Indeed  somewhat  obtrusive,  and  cannot  be  obtained  without 
imposing  some  extra  cost  on  the  operator's  performance  (having  to  count,  or 
keep  track  of  tones).  Yet  after  working  in  the  area  of  mental  workload  for 
over  10  years,  I  have  become  increasingly  convinced  that  there  is  "no  free 
lunch"  in  workload  measures.  Those  successful  measures  that  are  less 
obtrusive  into  primary  task  performance  (e.g.,  heart  rate  variability, 

Mulder  &  Mulder,  1981;  pupil  diameter,  Beatty,  1982),  by  the  very  aspect  of 
being  less  obtrusive  on  performance  and  as  a  result,  a  less  direct  measure 
of  cognitive  activity,  are  more  susceptible  to  other  influences,  and 
therefore  will  be  less  reliable.  Similar  concerns  arise  regarding  the 
applications  of  unobtrusive  ERP  measures  of  workload  derived  from  the  steady 
state  ERP  (Junker,  Kenner,  &  Casey,  1985).  The  fact  that  these  measures  may 
be  obtained  by  driving  the  visual  field  with  high  frequency  luminance 
functions  well  above  the  dlscrimlnable  critical  flicker  fusion  frequency,  is 
compromised  by  the  apparent  Inconsistency  with  which  parameters  of  the  SSERP 
measure  respond  to  Imposed  manipulations  of  workload  (l.e..  The  association 
shown  In  Step  2  of  Figure  2  Is  lacking). 

Most  recently,  as  noted  by  Parasuraman,  research  has  suggested  that 
ERPs  elicited  by  events  embedded  naturally  within  the  primary  task  can 
directly  Index  the  resources  allocated  to  the  primary  task,  and  thereby 
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indicate  the  workload  of  that  task  in  a  way  that  does  not  require  the 
instrusive  secondary  task  (Wickens,  et  al.,  1982;  Horst,  et  al.,  1985; 
Kramer,  Wickens  and  Donchln  1985;  Sirevaag,  et  al.,  in  press).  This  measure 
at  present  is  limited,  however,  by  the  sensitivity  of  primary  task  ERPs  to 
other  potentially  confounding  variables.  For  example,  a  large  ERP  elicited 
by  a  primary  task  event  may  result  either  because  the  task  was  difficult, 
and  more  resources  were  allocated  or  because  the  event  within  the  primary 
task  was  relatively  unexpected.  Also  of  course  some  primary  tasks  do  not 
lend  themselves  naturally  to  the  Incorporation  of  discrete  ERP-elicting 
events. 

Resource  allocation.  As  noted  above.  Increasing  levels  of  automation 
have  progressively  placed  human  operators  of  complex  systems  in  a 
supervisory  role  In  which  large  numbers  of  channels  of  visual  and  auditory 
information  must  be  monitored.  There  are  generally  few  overt  responses  made 
in  these  environments,  but  two  Important  covert  processes  are  often 
required:  (1)  monitoring,  and  then  detecting  and/or  recognizing  critical 
events;  (2)  Integrating  Information  to  update  a  mental  picture  of  the  system 
or  process  under  observation.  It  is  of  considerable  importance  for  the 
system  designer  to  attain  an  accurate  model  of  the  distribution,  breadth  or 
focusing  of  attention  under  these  circumstances.  To  obtain  such  a 
representation,  three  “performance"  measures  are  presently  available  -- 
verbal  protocols,  responses  to  stimulus  events  and  oculometric  measures. 

Here  then,  as  in  Step  1  of  Figure  2,  it  Is  appropriate  to  ask  what  unique 
variance  ERPs  have  to  contribute  beyond  that  available  from  performance.  In 

short,  why  not  performance? 
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In  answer  to  this  question,  it  should  be  noted  first  that  verbal 
protocols  may  be  unreliable.  Secondly,  reaction  time  or  response  measures 
cannot  be  easily  collected,  because  to  impose  a  requirement  to  respond  to 
each  event  for  which  a  measure  of  attention  is  required,  may  drastically 
alter  the  task  characteristics  in  an  unnatural  way.  Suppose,  for  example, 
one  wished  to  infer  whether  or  not  the  onset  of  a  caution  warning  light  were 
noticed  during  the  intense  Involvement  with  another  task.  It  would  not  be 
desirable  to  instruct  the  subject  to  "press  the  button"  when  the  advisory 
appears.  This  alteration  of  the  task  would  change  its  natural 
characteristics  and  give  the  processing  of  the  caution  light  greater 
priority  than  it  might  actually  have  in  the  real  world.  It  would  be  far 
better  here  to  allow  the  ERP  components  to  index  the  degree  of  processing 
given  to  the  light  occurring  in  its  natural  context. 

Unlike  overt  manual  or  verbal  responses,  crude  measures  of  visual 
fixations  can  be  recorded  without  greatly  disrupting  the  natural  information 
processing  characteristics  of  the  task.  As  long  as  the  inference  is  made 
that  where  the  eye  is  looking  is  exactly  what  is  processed,  then  necessary 
and  sufficient  Information  will  be  available.  The  problem  is  this 
assumption  is  limited  in  four  respects:  (1)  Different  items  or  dimensions 
of  information  can  be  either  attended  or  ignored  within  a  single  fixation 
(e.g.,  Donchin  &  Cohen,  1967;  Kramer,  Wickens,  &  Donchln,  1983).  Therefore, 
fixation  can  not  discriminate  attended  for  unattended  dimensions  of  a  single 
stimulus;  nor  the  allocation  of  resources  between  two  stimuli  within  1  of 
visual  angle.  (2)  Information  may  be  processed  and  attended  outside  of  the 
fovea  (Posner,  1982).  There  Is,  in  fact,  a  substantial  literature  showing 
the  value  of  parafoveal  motion  In  tracking  performance  (Wickens,  1986).  The 
degree  to  which  this  processing  is  taking  place  however  can  never  be 
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revealed  through  fixations.  (3)  Attention  may  be  switched  between 
modalities.  It  is  apparent  that  fixations  provide  no  clue  as  to  whether 
auditory  information  ic,  or  is  not,  being  processed.  (4)  Long  fixations  are 
ambiguous.  There  may  be  three  reasons  why  fixation  on  a  particular  stimulus 
is  of  a  long  duration.  First,  a  lot  of  information  may  be  extracted  from 
that  stimulus  (it  is  surprising,  novel,  or  complex).  Second,  the  stimulus 
may  be  of  poor  data  quality  requiring  a  long  fixation  to  resolve 
perceptually.  Third,  attention  allocation  may  be  allocated  elsewhere 
producing  what  Harris  and  Spady  (1985)  have  referred  to  as  a  "blank  stare." 

All  four  of  these  sources  of  ambiguity  may,  to  some  extent,  be  resolved 
by  recording  ERPs  elicited  by  the  stimuli  in  question.  For  example,  a  large 
ERP  will  discriminate  the  long  fixation  resulting  from  high  information, 
from  that  resulting  from  poor  data  quality  or  the  "blank  stare."  Therefore, 
the  best  “map"  of  attention  to  the  perceptual  world  seems  to  be  one  that 
takes  advantage  of  information  provided  jointly  by  ERPs  and  eye  fixations. 

Information  Processing  Stages 

Studies  of  latency  and  amplitude  changes  have  dominated  much  of  the 
research  In  ERPs.  Yet,  In  many  respects,  errors  are  the  most  critical 
variable  to  be  predicted  In  operational  environments,  and  there  is  a  growing 
interest  in  the  causes  and  sources  of  human  errors  (Rouse  &  Rouse,  1983; 
Norman,  1981;  Reason,  1984).  One  Important  scheme  has  associated  different 
classes  of  errors  with  different  stages  of  information  processing.  An 
important  distinction  Is  made  between  errors  of  perception  or  "mistakes,"  in 
which  a  stimulus  event  is  inappropriately  categorized,  and  errors  of 
response  selection  or  "slips"  In  which  an  Inappropriate  i  :t1on  Is  given  to  a 
stimulus  event  that  was  correctly  categorized. 
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From  a  human  engineering  point  of  view,  the  distinction  between 
mistakes  and  slips  is  important  because  different  corrective  actions  should 
be  taken  if  one  or  the  other  form  of  error  is  prevalent.  Mistakes  result 
because  stimulus  events  are  ill-defined,  poorly  resolved  or  this 
categorization  is  not  well  learned  by  the  perceiver.  A  corrective, 
therefore,  would  be  one  that  would  improve  the  display  format  for  a 
particular  piece  of  equipment.  In  marked  contrast,  slips  occur  with  highly 
automated,  well  learned  sequences  of  behavior,  and  are  often  a  consequence 
of  a  more  frequently  given  response  "capturing"  the  stream  of  behavior  and 
dominating  the  appropriate  response  of  lesser  frequency  (Reason,  1984; 
Norman,  1981).  This  will  be  likely  to  occur  when  the  level  of  automaticity 
Is  such  that  careful .-concise  monitoring  of  the  selection  of  responses  is 
not  carried  out.  Corrective  system  redesign  to  eliminate  slips  would 
involve  restructuring  the  placement  of  controls,  and  association  of  controls 
to  stimuli  In  a  manner  designed  to  prevent  their  occurance.  The  importance 
of  ERP  measures  in  discriminating  slips  from  mistakes  lies  in  their 
insensitivity  to  response  factors  (Region  3  in  Figure  1,  and  as  shown  in 
Figure  3).  P300s  produced  by  mistake  will  be  delayed,  probably  poorly 

resolved,  and  associated  with  equally  long  (If  not  longer)  response  times. 
P300s  produced  by  slips  may  also  be  longer  than  those  associated  with 
mistakes,  but  will  also  be  coupled  with  response  times  that  are  quite  short, 
relative  to  P300  latency  (Oonchin,  et  al..  In  press). 

Sunwary 

Human  Factors  Is  a  blossoming  field,  with  a  clear  need  for  new  and 
innovative  methodologies.  This  neeed  is  enhanced  by  the  rapid  evolution  of 
computerized  systems  with  which  humans  must  interact.  Therefore  it  is  with 
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some  confidence  that  one  can  foresee  the  application  of  well-modeled,  well 
understood  techniques  based  upon  ERPs,  taking  place  in  the  future.  The  many 
steps  In  Figure  2  suggest  however  that  these  applications  may  not  occur 
within  the  Imnedlate  future.  Yet  with  the  proper  selective  focus  on 
relevant  paradigms,  derived  by  listening  to  and  interpreting  the  needs  of 
the  human  factors  practitioner,  and  addressed  with  concern  for  real  world 
complexity,  there  Is  no  reason  why  the  process  cannot  be  accelerated. 
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FIGURE  CAPTIONS 


Figure  1:  Three  Regions  of  Variance.  Region  1:  Association.  Region  2: 
Dlssassociation  (performance  Insensitivity).  Region  3: 
Dissassoclatlon  (ERP  insensitivity) 

Figure  2:  Sequence  of  steps  necessary  for  complete  and  satisfactory 

application  of  ERP  methodology  to  Human  Factors  design  Issues. 
The  text  describes  the  differenct  step  munbers. 

Figure  3:  Causes  of  region  3  dissociation.  The  figure  depict  three 

processing  stages  at  the  bottom,  only  the  first  two  of  which 
reflect  P300  latency.  The  third,  response  stage  does  not.  This 
stage  also  depends  upon  separate  processing  resources  from  those 
that  supply  P300-dependent  stages  (top  of  figure).  Therefore, 
variation  of  response  load  will  affect  RT  and  global  workload 
measures,  but  not  P300  latency  nor  P300  amplitude. 
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Abstract 

Subjects  were  instructed  to  use  either  rote  or  elaborative  strategies 
to  memorize  words  in  a  von  Restorff  paradigm.  When  instructed  to  use  rote 
strategies,  subjects  displayed  a  higher  von  Restorff  effect  and  a  lower 
recall  performance  than  when  instructed  to  use  elaborative  strategies. 
Furthermore,  the  amplitude  of  the  P300  component  of  the  event-related  brain 
potential  predicted  subsequent  recall  only  when  subjects  used  rote 
strategies.  When  subjects  used  elaborative  strategies,  the  relationship 
between  P300  amplitude  and  subsequent  recall  was  not  observed. 

These  results  confirm  and  expand,  in  a  within  subjects  design,  the 
results  reported  by  Karis,  Fabiani,  and  Donchin  (1984)  who  capitalized  on 
different  strategies  used  by  different  subjects.  These  results  also  lend 
support  to  a  three-phase  model  of  the  von  Restorff  effect. 
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Introduction 

Karis,  Fabiani,  and  Donchin  ( 1 Qft4 )  reported  that  people  who  used  rote 
rehearsal  strateaies  were  more  likely  to  recall  stimuli  that  had  elicited 
larae  P300s.  This  relationship  between  P3O0  amplitude  and  subseouent  recall 
was  not  observed  in  suhiects  who  used  elaborative  strateaies.  The  choice  of 
rehearsal  strateaies  in  the  Karis,  Fabiani,  and  Donchin  (1984)  experiment 
was  left  to  the  subjects  and  was  ascertained  only  in  a  post-experimental 
debriefina.  In  the  present  study  the  subjects'  strategies  were  manipulated 
by  instructions  and  each  subject  performed  the  task  usina  both  rote  and 
elaborative  rehearsal  strategies.  This  wi thin-subject  desian  is  necessary 
to  confirm  the  oriainal  report  by  Karis,  Fabiani,  and  Donchin  (1984). 

The  P3QQ  Component  of  the  FRP 

The  P3O0  component  of  the  FRP  was  first,  described  by  Sutton,  Rraren, 
7uhin,  and  John  (1968).  It  peaks  3D0  ms  or  more  after  the  eliciting  event, 
it  is  positive  at  all  the  midline  electrode  sites,  and  is  maximal  at  the 
parietal  electrode  (Pz;  10-2(1  International  System,  Jasper,  1988). 

The  P30P  is  elicited  only  by  events  that  are  relevant  to  the  task  the 
subject  is  performing,  and  its  amplitude  is  inversely  related  to  the 
subjective  probability  of  the  eliciting  event  (Duncan-Johnson  &  Donchin, 
1977;  Johnson  f>  Donchin,  1978).  The  dependence  of  P300  amplitude  on  these 
variables  suagests  that  the  process  manifested  by  P300  is  associated  with 
the  processino  of  novel  and  relevant  stimuli . 

One  of  the  critical  clues  to  the  functional  significance  of  the  P300  is 


that  the  processing  represented  by  P300  appears  to  be  used  in  the  service  of 
future  actions,  rather  than  in  t*e  execution  of  the  specific  responses  to 
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the  elicitina  event  (Oonchin,  197Q;  Donchin,  Gratton,  Duoree,  &  Coles,  in 
press;  Oonchin,  Ritter,  *  McCallum,  1Q7R;  Gehrinq,  Gratton,  Coles,  & 

Oonchin,  1°R6;  Kutas,  McCarthy,  *  Donchin,  1977;  Munson,  Ruchkin,  Ritter, 
Sutton,  A  Sauires,  lpR4).  For  instance,  Oonchin,  et  ai .  (in  press),  used  a 
choice-reaction  time  task  in  which  the  probability  of  the  stimuli  was 
manipulated  and  found  that  sub.iects  were  biased  in  favor  of  the  frequent 
response,  and  often  erred  when  a  rare  stimulus  was  presented.  However,  when 
a  laroe  p3Pn  was  emitted  after  an  incorrect  response  to  a  rare  stinulus,  the 
response  bias  in  the  followino  trials  was  reduced.  Similarly,  Gehrinq  et 
al .  (19R6)  observed  that  the  amplitude  of  P300  after  incorrect  responses  to 
rare  events  predicted  shifts  in  the  subjects'  response  strateqies. 

These  data,  and  data  from  studies  that  focused  on  cbanqes  in  P300 
amplitude  as  a  function  of  the  previous  seouence  of  stimuli  (Sauires, 
Wickens,  Sauires,  J  Donchin,  1 976 J1  and  on  variations  in  inter-stimulus 
interval  (Heffley,  19R1 ;  Fitzqerald  *  Picton,  l°8l),  are  consistent  with  the 
hypothesis  that  p300  is  the  manifestation  of  a  process  invoked  when  there  is 
a  need  to  revise  the  representations  in  workinq  memory  (or  "context 
updatinq";  Donchin,  19R1).  When  a  rare  event  occurs,  or  an  error  is  made, 
or  conditions  chanqe,  this  information  must  be  incorporated  into  coqnitive 
schemas.  These  schemas  will  qovern  both  the  perception  and  action  taken  on 
future  trials,  and  will  also  affect,  the  recall  of  the  information  of  the 
specific  trial . 
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The  Role  of  Novelty  in  Learning:  P300  and  Memory 

In  recent  years,  the  role  of  novelty  in  learninq  has  been  enphasized  by 
several  theories  of  animal  learninq  (MacKintosh,  1975;  Pearce  %  Hall,  19R0; 
Pescorla  *  Waqner,  1972;  Waaner,  1976;  197R;  see  also  Dickinson,  19R0,  for  a 
oeneral  review).  No  one  theory  has  yet  received  universal  acceptance,  but 
all  theorists  aooear  to  aqree  that  chanqes  in  a  stimulus  representation 
followinq  a  learninq  experience  depend  upon  the  extent  to  which  the 
predictor  and/or  the  Dredicted  event  are  jointly  Drocessed  in  workinq 
memory.  A  critical  condition  for  this  joint  processinq  is  that  the  stimulus 
should  be  surprisinq  or  unexpected.  Similar  theories  of  human  memory 
(Dhman,  1R7Q;  Sokolov,  1963;  1969;  1976)  have  also  arqued  that  a  short  term 
memory  is  used  to  maintain  an  internal  model  of  a  dynamic  environment,  and 
that  deviations  from  this  internal  model  reouire  an  updatinq  process. 

Rare  or  unexpected  events  should  lead  to  a  restructurina  or  updatinq  of 
the  current  memory  schemas,  because  only  in  this  way  can  an  accurate 
representation  of  the  environment  be  maintained.  The  updatinq  process  may 
involve  an  "activation"  of  the  memory  representation  of  the  event,  or  the 
"markina"  of  some  attribute  of  the  event  that  was  "distinctive",  and 
therefore  crucial  in  determinino  the  updatinq  process.  This  restructuring 
of  the  memory  representation  of  an  event  is  assumed  to  facilitate  its 
subsequent  recall,  by  providino  valuable  retrieval  clues,  so  that  the 
nreater  the  restructuring  that  follows  an  individual  event,  tht  higher  the 
probability  of  later  recalling  that  event.  P300  amplitude  is  assumed  to  be 
prooortionai  to  the  deqree  of  restructuri nq  of  the  memory  representation  of 
**e  »vent.  Therefore,  P300  amplitude  should  also  predict  the  subsequent 
1  r>f  The  eliciting  event. 
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The  Use  of  P3QO  i n  the  Investigation  of  Worki nq  Memo ry 

The  von  Restorff  study.  Kan's,  Fabiani,  and  Donchin  (1984)  used  the 
von  Restorff  paradigm  (von  Restorff,  1Q33)  to  test  the  hypothesis  that  P300 
amplitude  predicts  subsequent  recall.  They  presented  series  of  unrelated 
words  and  recorded  the  FRP  elicited  by  each  word.  After  each  series  the 
subjects  were  asked  to  recall  as  many  words  of  the  series  as  possible.  Most 
o*  the  series  contained  a  deviant  word  (an  "isolate"),  with  the  isolation 
achieved  bv  chancing  the  size  of  the  word.  When  one  item  in  a  series  is 
distinctly  di fferent  from  the  others  (e.a.,  because  of  color,  size,  meaning, 
or  class),  the  probability  that  it  will  be  recalled  increases.  The  label 
von  Restorff,  or  isolation  effect,  refers  to  the  enhanced  recall  of  an 
"isolated"  item,  with  respect  to  comparable,  non-isolated  items  (for  a 
review,  see  Cimbalo,  1°78;  Wallace,  1%8). 

Measures  of  the  magnitude  of  the  von  Restorff  effect,  of  the  qeneral 
recall  performance,  and  of  the  amplitude  of  the  P300  component  of  the  ERP 
were  computed  for  each  subject.  Strikino  individual  differences  emerqed  on 
all  measures,  and  subjects  were  placed  into  three  distinctly  different 
groups  according  to  the  magnitude  of  the  von  Restorff  effect  in  the  free 
recall.  In  group  1,  subjects'  overall  performance  was  low,  but  "isolating" 
a  word  by  changing  its  size  increased  recall  dramatically  (a  hiah  von 
Restorff  effect).  These  subjects  reported  using  primarily  rote  strategies 
(e.g.,  repeating  the  words).  It  was  only  in  these  subiects  that  Karis, 
cabiani,  and  Donchin  (1984)  observed  a  positive  relationship  between  P3D0 
amplitudA  and  recall.  At  the  other  extreme,  subiects  in  group  3  which 
showed  no  relationship  between  P3C0  and  recall,  exhibited  high  overall 
recall  oercentaoes,  and  no  isolation.  These  subiects  reported  using  complex 
elahorative  strategies  (e.g.,  making  up  sentences,  stories  or  imaoes).  The 
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amplitude  of  a  frontal-positive  slow  wave"3 was  related  to  subseouent  recall, 
and  the  elaborators  exhibited  more  evidence  of  this  component  than  the  rote 
memorizers.  Karis,  Fabiani,  and  Donchin  (1QP4)  speculated  that  the  slow 
wave  could  be  associated  with  the  heainninq  of  elaborative  organizational 
processes. 

The  intriouina  aspect  of  the  data  reported  by  Karis,  Fabiani,  and 
Donchin  (1984)  is  the  modulatinq  role  that  rehearsal  strategies  played  on 
the  relationship  between  P300  and  recall.  P300  provides  information  about 
the  coanitive  processina  of  an  event  occurring  durinq  the  first  second  after 
its  oresentation,  while  rehearsal  strategies  that  influence  recall  often 
continue  for  an  extended  period.  The  relationship  between  P300  and  recall 
will  thus  depend  on  the  nature  of  the  extended  mnemonic  processina  durinq 
rehearsal.  The  results  of  the  Karls,  Fabiani,  and  Donchin  (1984)  study  are 
consistent,  with  a  three-phase  model  of  the  processes  that  determine  the 
subjects'  recall  of  the  words.  In  the  first  phase,  in  which  the  stintili  are 
encoded  and  cateaorized,  the  subiects'  information  processina  is  independent 
of  their  subseouent  use  of  rehearsal  strategies.  We  assume  that  the  process 
mani fested  bv  the  P300  is  activated  by  the  words  that  are  presented  in  the 
distinct,  or  "Isolated,"  font.  Karis,  Fabiani,  and  Donchin  1 904 1  examined 
the  distribution  of  P30O  amplitude  in  all  subjects  and  found  no  difference 
between  the  different  aroups  of  subiects.  It  would  appear  that  all  subjects 
noticed  the  isolated  words  to  the  same  extent  and,  as  a  conseauence,  a  P300 
was  always  elicited  in  response  to  the  Isolates.  We  assume  that  the 
representations  of  the  isolated  words  were  "marked"  or  activated  in  some 
fashion,  and  this  is  reflected  by  P300  amplitude.  It  is  at  this  point  that 
the  suhjects  diverged  into  aroups.  In  the  second  phase  the  subjects  engaaed 
in  various  activities  designed  to  enhance  their  ability  to  recall  the 
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stimuli.  Some  subjects  opted,  when  left  to  their  own  devices,  to  adopt 
rote-rehearsal  strateqies,  while  other  suhjects  enaaaed  in  various  semantic 
elaboration  strateqies.  when,  in  the  third  phase,  the  subjects  were 
required  to  recall  the  words,  the  rote-memorizers,  we  proposed,  retrieved 
words  on  the  basis  of  the  initial  representations  created  durinq  phase  1. 
Some  of  these  representations  were  marked  or  activated  by  their  association 
with  P300  and  the  larqer  the  P300  the  stronaer  the  mark  or  the  activation 
and  the  more  likely  the  recall.  On  the  other  hand,  at  the  moment  ot 
retrieval  the  elahorators  relied  on  the  new  representations  they  created 
durina  the  elaboration  process,  and  these  representations  did  not  carry  the 
"marks"  associated  with  the  P300;  hence  the  lack  of  correlation  between 
recall  and  P300  amplitude  in  the  elahorators. 

An  Incidental  Memory  Experiment.  The  effect  of  rehearsal  strateaies  on 
the  P300-reca11  relationship  was  not  anticipated  as  we  desioned  the  study 
reported  by  Karis,  Fabiani,  and  Donchin  (198*1.  Thus,  no  control  was 
attempted  over  rehearsal  strateaies.  To  confirm  the  results  we  examined  the 
relationship  between  P300  and  recall  in  a  paradiqm  where  subjects  did  not 
have  any  reason  to  adopt  an  elaborative  strateay  (Fabiani,  Karis,  *  Donchin, 
19861 .  We  embedded  an  incidental  memory  task  in  another  study  (Fabiani, 
Gratton,  Karis,  and  Donchin,  in  press;  Karis,  Coles,  A  Donchin,  19841,  in 
which  subjects  were  presented  with  a  succession  of  oddball  tasks4,  none  of 
which  involved  recallinq  the  stimuli.  After  series  in  which  the  subject  was 
instructed  t.o  respond  di fferentially  to  tones,  or  to  count  either  the  letter 
"H"  or  "S,"  we  inserted  a  series  composed  of  randomly  mixed  male  and  female 
names,  and  instructed  the  subject  to  count  names  of  one  qender.  The 
subjects  bad  no  reason  to  expect  that  they  would  be  asked  to  recall  these 
names.  Therefore,  Fabiani  et  al .  (19861  assumed  that  they  would  not 
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develop,  and  use,  complex  elahorative  rehearsal  strategies  to  facilitate 
recall.  In  this  situation,  the  relationship  between  P300  and  recall  could 
be  evaluated  in  the  absence  of  elahorative  processes  occurrinq  after  P300. 
Our  main  prediction  was  that  names  that  were  recalled  would  elicit  laraer 
P300s  on  their  initial  presentation  than  names  that  were  not  recalled.  Our 
prediction  was  confirmed  and  no  consistent  differences  emerqed  amonq 
subjects,  thus  indicatinq  that,  when  elahorative  strateqies  are  prevented, 
the  amplitude  of  P300  predicts  subsequent  recall. 

The  Present  Study.  Althouah  the  Fabiani  et  al .  (19R6)  data  are 
consistent  with  the  predictions  derived  from  the  Karis,  Fabiani,  and  Donchin 
(1QR4)  study,  they  are  based  primarily  on  a  negative  finding;  namely,  that 
individual  differences  due  to  rehearsal  strategies  will  be  reduced  when  an 
incidental  memory  test  is  employed,  thus  allowing  for  the  P300/recall 
relationship  to  emerae  more  consistently.  However,  the  individual 
differences  observed  by  Karis,  Fabiani,  and  Donchin  (1984)  could  still  be 
attributed  to  the  subjects'  idiosyncrasies  rather  than  to  the  use  of 
particular  mnemonic  strateqies.  Therefore,  it  is  crucial  to  determine 
whether  recall  performance  is  influenced  by  the  interaction  between  the 
updating  process  (manifested  by  P300)  and  mnemonic  strategies  not  only 
between  but  also  wl thl n  subjects.  Thus,  we  devised  a  oaradiam  in  which  we 
directly  manipulated  strateoies  by  instructions.  The  subjects  were  run  in  a 
von  Restorff  paradigm,  similar  to  that  used  by  Karis,  Fabiani,  and  Donchin 
(1884).  Instructions  to  use  "rote”  strateoies  required  the  subject  to 
reoeat  each  word  as  it  was  presented,  while  "elaborative"  instructions 
required  the  subject  to  combine  words  into  imaoes,  sentences,  or  stories 
(both  sets  of  instructions  are  reported  in  the  Appendix).  We  expected  that 
the  same  subject  would  behave  like  a  rote  memorizer  of  the  Karis,  Fabiani, 
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and  OoncMn  (1984)  study  when  aiven  rote  instructions,  and  like  an 
elaborator  when  aiven  elaborative  instructions. 

This  study  also  allowed  a  test  of  predictions  derived  from  the 
"distinctiveness  hypothesis,"  which  has  been  invoked  to  explain  the  von 
Restorff  effect  (Rird,  1880),  as  well  as  the  enhanced  recall  of  distinctive 
items  (Eisher,  1981;  Hunt  and  Elliott,  1980;  Hunt  4  Mitchell,  1878;  1982; 
Schmidt,  1888;  Zechmeister,  1872).  Distinctive  features  of  an  item  are 
those  shared  by  few  other  items.  The  distinctive  features  of  a 
to-be- remembered  item,  beino  uniaue,  will  be  more  useful  in  retrieval  than 
common  features,  shared  by  many  other  items.  Thus,  the  distinctiveness 
hypothesis  focuses  on  the  utility  of  the  encoded  information.  However,  the 
utility  of  the  encoded  information  is  also  relative  to  the  context  in  which 
the  information  is  retrieved.  As  Fisher  (1980,  p.  310)  reports,  "a 
distinctive  encoding  operation  will  improve  retention  to  the  extent  that  the 
encodino  context  is  reinstated  at  retrieval."  In  the  present  experiment, 
the  deviant  size  of  the  word  can  be  considered  a  "distinctive"  attribute  of 
the  memory  representations  of  the  isolates.  However,  this  size  attribute 
should  only  be  useful  at  retrieval  after  the  subjects  have  used  rote 
rehearsal  strategies.  Therefore,  we  hypothesized  that  subjects  will  have  a 
better  memory  of  the  word  size  when  they  have  used  rote  strategies  than  when 
they  have  used  elaborative  strateaies.  To  test  this  hypothesis  we 
constructed  a  "size-recall  test." 
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Method 


Subjects 

Ten  riqht-handed  female  subjects  were  run  Individually  In  an  experiment 
which  extended  over  three  sessions.  All  the  subjects  were  underqraduate 
students  at  the  University  of  Illinois  (aae  ranee  «  IP  to  21,  median  = 

10. SI.  They  were  paid  S3. SO  per  hour,  with  a  S5  bonus  when  they  completed 
the  third  session. 

Data  Col  lection 

Aq-AqCl  Beckman  Biopotential  electrodes  were  affixed  alona  the  midline 
of  the  scalp  at  frontal,  central,  and  parietal  sites  (Fz,  Cz,  and  Pz)  by 
means  of  Grass  EC-?  electrode  cream.  The  subjects  were  qrounded  via  Ao-AqCI 
Beckman  Bioootential  electrodes,  affixed  to  the  forehead  by  means  of 
adhesive  collars.  The  same  type  of  electrodes  were  affixed  above  and  below 
the  riaht  eye  to  record  the  vertical  electroocul oaram  (EOG),  and  on  the 
mastoids.  Linked  mastoids  were  used  as  references.  Electrode  impedance  did 
not  exceed  10  KOhm.  The  EEG  was  amplified  with  Van  Goqh  Model  50000 
amplifiers  (time  constant  10  seconds,  upper  half-amplitude  frequency  35  Hz, 
3dB/octave  roll-off)  and  was  dioitized  at  the  rate  of  100  samples/sec. 

All  aspects  of  experimental  control  and  data  collection  were  controlled 
by  a  P0P-11/40  computer  system  Interfaced  with  an  Imlac  oraohics  processor 
(Oonchin  S  Heffley,  1975).  Averaae  waveforms  and  the  si nql e-trial  records 
were  monitored  on-line  uslnq  a  DEC  VT-11  display  processor.  Eye  movement 
artifacts  were  corrected  off-line  uslnq  a  procedure  described  by  Gratton, 


Coles,  and  Oonchin  (19*3). 

The  data  were  filtered  before  beinq  analyzed  by  uslnq  an  off-line 
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moving  average  (1  iteration)  corresoondi no  to  a  3.14  Hz  filter  (Ruchkin  A 
Glaser,  1Q7R). 

Word  Li sts 

Five  word  lists  were  constructed.  The  first  list  was  used  for  all  the 
subjects  durino  the  first  session  (list  F),  and  contained  40  series  of  15 
words.  The  remaining  4  lists  were  used  for  the  second  and  third  sessions  (a 
pair  per  session  -  Lists  A-R  and  C-D).  Each  of  these  lists  contained  20 
series  of  15  words.  Seventy-five  percent  of  the  series  contained  an 
isolated  word  (Isolated  series),  and  25%  did  not  (Control  series).  The 
isolated  word,  displayed  in  laroe  size,  occurred  at  random  anywhere  from 
position  6  through  10  of  each  Isolated  series. 

Words  in  each  list  were  selected  at  random  hy  a  computer  program,  from 
a  master  list  composed  of  all  the  actual  words  with  3  to  7  letters  in  Toglia 
and  Rattig  (1<>7R).  Each  word  could  appear  in  only  one  list,  so  that  it  was 
presented  no  more  than  once  to  each  subject.  The  computer  program  also 
determined,  at  random,  which  series  in  each  list  were  to  contain  an  isolate, 
and  the  position  of  the  isolate  (from  6  through  10). 

In  list  F  (the  list  used  for  the  first  session),  the  first  10  series 
were  used  for  the  "No  strategy  instruction"  condition.  Eight  of  the  series 
contained  an  Isolate.  The  next  15  series  were  used  for  the  "Rote  strategy" 
condition,  and  11  contained  an  isolate.  Finally,  the  last  15  series  were 
used  for  the  "Elahorative  strategy"  condition,  and  11  of  these  also 
contained  an  Isolate.  In  lists  A,  B,  C,  and  D  (used  for  the  second  and 
third  sessions),  15  of  the  20  series  contained  an  isolate.  The  pair  of 
lists  (A-P  or  C-D)  used  in  each  session  (second  or  third),  and  the  order  of 
strategy  instructions  were  counterbal anced  among  subjects. 
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Six  practice  series  were  also  constructed  accordinq  to  the  same  rules. 
Words  in  the  practice  series  were  not  included  in  any  of  the  experimental 
1  i st s .  Two  practice  series  (one  Isolated  and  one  Control)  were  used  for 
each  of  the  instruction  conditions. 

Words  in  each  series  were  presented  seauentially,  for  260  ms  each,  with 
a  2000  ms  interval  between  words.  Non-isolated  words  were  formed  by  12mm  X 
12mm  letters  (word  lenath  36mm  to  04mm,  visual  anqle  2.25  to  5.25  degrees). 
Isolated  words  were  laraer,  and  were  formed  by  20mm  t  20mm  letters  (word 
length,  60mm  to  140mm,  visual  anqle  3.75  to  ft. 75  deqrees).  Size  differences 
were  easily  discriminate,  and  all  the  subjects  reported  they  could  easily 
read  words  displayed  in  either  size  in  the  time  provided. 

Procedure 

The  subject  was  seated  in  an  air  conditioned  unshielded  room  in  front 
of  a  Hewlett  Packard  (HP)  CRT  display  ( #1 310A ) .  The  recording  and  control 
apparatus  were  located  in  an  adjacent  room. 

There  were  three  sessions,  with  ft  to  26  days  (median  =  17  days)  between 
the  first  and  second  session,  and  1  to  5  days  between  the  second  and  third 
session  (median  =  2  days).  The  first  session  was  a  baseline  and  traininq 
session.  For  this  reason  it  had  a  different  experimental  design,  and  there 
was  a  lonqer  interval  between  the  first  and  the  second  session  than  between 
the  second  and  the  third.  The  qeneral  procedure  and  instructions  were 
similar  to  those  used  by  Karis  Fabiani,  and  Donchin  (19ft4),  with  the 
exception  of  the  strateqy  instructions.  As  shown  in  Figure  1,  during  each 
session  the  subjects  were  presented  with  40  series  of  15  words  each,  and 
were  asked  to  memorize  as  many  words  as  they  could  and  to  write  them  down  at 
the  end  of  each  series  (free  recall  task). 
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Insert  Fioure  1  about  here 


The  suh.iect  was  qiven  a  clipboard  and  recall  sheets  (one  per  series)  on 
which  to  write  the  words  after  each  15-word  series  was  completed.  A 
5-second  pause  was  interposed  at  the  end  of  each  series,  durinq  which  the 
suh.iect  was  instructed  to  aet  the  recall  sheet  for  that  series.  At  the  end 
of  this  pause  a  small  liaht  attached  to  the  clipboard  was  turned  on, 
si  anal  i  no  the  suh.iect  to  pick  up  the  pen  and  start  writing.  Removal  of  the 
pen  from  its  holder  activated  a  switch  monitored  by  the  experimenter.  This 
ensured  that  the  suh.iect  could  not  beoin  writina  prematurely.  Fifty-five 
seconds  were  provided  for  the  free  recall,  and  all  sub.iects  reported  that 
this  interval  was  sufficient.  The  writina  liqht  was  then  turned  off  to 
indicate  that  the  recall  period  had  ended.  After  a  warnino  ("ready?")  from 
the  experimenter  (via  an  intercom),  another  series  was  presented. 

In  the  fi rst  session  (baseline  and  training  session) ,  ERPs  were  not 
recorded.  Two  practice  series  were  aiven  to  the  subject  at  the  beqinninq  of 
the  session.  The  first  one  contained  an  isolated  item,  while  the  second  was 
a  control  series.  After  the  isolated  series,  the  subjects  were  asked  if 
they  had  noticed  that  one  word  was  larqer  than  the  others,  and  were  told 
that  occasionally  a  word  would  appear  larqer,  but  that  they  should  attend  to 
all  the  words,  and  ignore  size  differences  among  words.  No  strateqy 
instructions  were  qiven  to  the  subject  before  the  first  qroup  of  10  series. 
At  the  end,  the  subjects  were  interviewed,  and  this  provided  Information 
about  their  spontaneous  use  of  mnemonic  strategies.  The  next  15  series  were 
preceded  hv  Instructions  to  use  rote  strategies,  and  by  two  additional 
practice  series.  The  final  15  series  were  preceded  by  elahorative  strategy 
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instructions  and  by  two  more  practice  series.  This  manipulation  provided 
information  about  the  subject's  ability  to  use  both  strateqies,  and  ensured 
that  all  the  subjects  would  start  the  experimental  sessions  with  the  same 
familiarity  with  both  strateaies.  The  subjects  were  given  approximately  5 
minutes  rest  after  the  first  10  series,  and  aoain  after  the  next  IS  series. 

In  both  the  second  and  thi rd  sessi on  ( experimental  sessions)  ERPs  were 
recorded  to  each  word,  and  subjects  were  instructed  to  use  one  strateoy 
(either  rote  or  elaborative)  durinq  the  first  half  of  the  session  (20 
series),  and  the  other  durina  the  second  half  (20  series).  The  order  of 
instructions  was  counterbalanced  across  sessions  and  subjects.  The  subjects 
were  qiven  short  (approximately  five  minutes)  rest  periods  after  every  ten 
series.  At  the  end  of  each  block  of  series  memorized  usinq  the  same 
strateoy,  the  subject  was  asked  to  describe  the  strateoies  she  used. 

In  addition  to  the  free  recall  task,  the  third  session  included  a 
countinq  task  (oddball  task),  and  a  size-recall  test.  The  oddball  task  and 
the  size-recall  test  were  unexpected.  The  size-recall  test  is  described  in 
more  detail  below.  The  oddball  task  will  not  be  further  discussed,  because 
it  served  mainly  as  a  "filler"  task  between  the  free-recall  and  the 
si ze-recall . 

The  EEG  data  were  acauired  whenever  a  stirolus  word  was  presented  on 
the  HP  screen  (i.e.,  dlirinq  the  free  recall  and  the  oddball  tasks).  For  the 
free  recall,  the  stimulus  duration  was  250  ms,  the  inter-stimulus  interval 
was  2000  ms  and  the  recording  epoch  was  also  2000  ms,  beoinnino  100  ms  prior 


to  stimulus  onset 
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Strategy  Instructions  for  the  F ree  Recal 1  Task .  The  strateqy 
instructions  were  the  crucial  independent  variable  in  this  experiment.  To 
test  the  effectiveness  of  our  strateay  instructions  we  ran  a  pilot  study 
with  ?6  subiects.  The  subiects  were  divided  in  two  groups,  and  the  members 
of  each  group  were  run  together  (no  ERPs  were  recorded  in  this  session). 
Eleven  series  of  15-words  each  were  read  out  loud  by  D.  K. ,  at  the  rate  of  1 
word  every  two  seconds.  The  readinq  rate  was  maintained  constant  with  the 
help  of  a  metronome.  A  5-second  pause  followed  the  readinq,  and  then  5D 
seconds  were  allowed  for  free  recall.  Refore  the  first  three  series,  the 
subjects  were  instructed  to  recall  as  many  words  as  they  could,  and  no 
Darticular  mnemonic  strateqy  was  suggested.  Then,  before  the  next  four 
series,  instructions  were  given  to  use  either  rote  or  elaborative 
strategies.  Finally,  the  subiects  were  instructed  to  use  the  other  strateqy 
before  the  last  four  series.  The  order  of  instructions  was  counterbalanced 
between  the  two  groups  of  sub.iects.  Note  that  the  interest  of  this  pilot 
study  was  to  test  the  effectiveness  of  our  instructions  in  manipulating 
recal 1  performance.  Therefore,  no  isolated  items  were  included  in  the 
li sts. 

We  found  that  our  strategy  instructions  were  effective  in  influencing 
recall  performance.  The  subiects  recalled,  on  average,  51%  of  the  words 
when  no  strategy  instructions  were  given.  They  recalled  42%  of  the  words 
under  rote  Instructions,  and  55%  under  el aboraiti ve  instructions.  The 
difference  between  rote  and  elaborative  instructions  was  significant, 

F( 1 , 26)  *  25.67,  p  <  0.0001. 

In  order  to  minimize  variations  from  session  to  session  and  from 
subject  to  subject  the  instructions  were  recorded  on  tape.  The  subjects 
were  told  that  they  could  interrupt  the  tape  if  they  had  Questions,  and  were 
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debriefed  at  the  end  of  the  recording  to  make  sure  they  understood  the 
instructions  correctly.  Rote  strategy  instructions  reauired  the  subjects  to 
repeat  the  words  silently  to  themselves,  in  any  way  they  chose.  El aborati ve 
strategy  instructions  required  the  subjects  to  connect  or  organize  the 
words,  by  makinq  sentences,  or  forminq  imaqes  or  pictures  with  them.  The 
strategy  instructions  and  the  practice  series  were  repeated  in  each  session, 
as  a  reminder  to  the  subjects,  and  as  a  way  to  keep  the  sessions  as  standard 
as  possible.  The  strategy  instructions  are  presented  in  the  Appendix. 

Even  though  our  instructions  had  proven  effective  in  the  nilot  study, 
we  felt  that  an  onooi nq  report  on  the  subjects'  use  of  strategies  would  be 
useful,  in  order  to  soot  temporary  shifts  in  strategy  use.  Therefore  we 
added  an  additional  check  of  the  subjects'  use  of  strategies,  nn  too  of 
each  recall  sheet  there  were  two  boxes,  one  with  a  "Y"  and  one  with  an  "N." 
The  suhlects  were  asked  to  indicate,  by  checkinq  one  of  the  boxes  for  each 
15-word  series,  whether  or  not  they  had  been  able  to  use  the  reouired 
strategy.  In  session  2  and  3  only  those  series  for  which  the  subjects 
reported  (bv  marking  the  "Y"  box)  having  used  the  strategy  requi red  by  the 
instructions  were  used  in  the  computation  of  the  recall  performance  and  the 
von  Restorff  index  (a  measure  of  the  magnitude  of  the  von  Restorff  effect). 
For  the  10  subjects,  there  were  31  series  out  of  800  presented  in  which  the 
"N"  box  was  marked  (3.9%).  Eiqhteen  of  them  were  series  for  which 
elaborative  instructions  had  been  given  (A.^%  of  the  series  presented  under 
el aborati ve  instructions)  and  13  of  them  were  series  presented  under  rote 
instructions  (3.3%  of  all  the  series  presented  under  rote  instructions). 

Five  of  the  10  subjects  did  not  check  any  "N"  box.  Of  the  remaining  5 
suhlects  only  one  (subject  3)  checked  a  hiqh  number  of  "N"  boxes  (18 
series).  Note  that  the  number  of  lists  checked  "N"  by  this  subject  amounts 
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to  more  than  half  of  the  total  number  of  series  checked  "N"  by  the  entire 
qrouD  of  10  subjects  (31  series!.  The  18  "N"  series  marked  by  subject  3 
correspond  to  ??.Ft  of  all  the  series  presented  to  her  ( 2 8%  under 
elaborative  instructions  and  RfH  under  rote!.  It  is  also  interestinq  to 
note  that,  for  the  "N"  series,  the  overall  recall  performance  of  this 
suhiect  was  hiaher  for  rote  (^St)  than  for  elaborative  U0%!  instructions, 
thus  sunoestina  that  the  subject  was  indeed  usinq  the  opposite  strateay  from 
the  one  suooested.  Pehriefino  confirmed  this  interpretation.  For  the 
-emainina  five  suhiects  the  percentaoe  of  lists  excluded  never  exceeded  8% 
of  the  total  number  of  lists  presented  in  the  two  experimental  sessions. 

C;ize-Decaii  Test.  In  order  to  determine  whether  subjects  could  recall 
which  words  were  isolated,  we  devised  a  size-recall  test.  The  subject  was 
presented  with  a  orinted  list  of  all  the  isolates  presented  in  the  third 
session  'half  under  -ote  and  half  under  elaborative  instructions),  randomly 
intersperse*  with  an  eoual  number  of  non-isolated  words  (half  from  each 
strateav  instruction!.  All  words  in  the  list  were  of  course  orinted  in  the 
same  size  font.  Isolates  and  non-isolates  presented  in  the  size-recall  test 
were  matched  for  position.  The  subject  was  told  to  indicate  whether  or  not 
each  word  had  been  onqinallv  displayed  in  laroe  size.  An  oddball  task  was 


interposed  between  the  free  recall  task  and  the  size-recall  test. 
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Resul  t.s 

Data  from  the  first  session  (baseline  and  traininq  sessions)  were 
analyzed  separately  from  those  from  the  second  and  third  session 
(experimental  sessions).  Data  from  the  second  and  third  session  were  also 
examined  separately  at  first,  to  make  sure  that  there  were  no  siqnificant 
order  effects  (details  are  presented  below).  Since  the  data  obtained  in 
these  two  sessions  were  comparable,  they  were  combined  for  all  the 
subseauent  analyses. 

Analysis  of  Pecal 1 

As  in  the  Karis,  Fahiani,  and  Donchin  (19RA)  study,  we  computed  two 
indices  to  summarize  the  subjects'  performance  in  the  free  recall  task:  a 
measure  of  the  maqnitude  of  the  von  Pestorff  effect  (von  Restorff  Index,  or 
VRI)  and  an  index  of  the  overall  recall  performance  (P).  Both  indices  were 
computed  usinq  the  words  the  subject  recalled  durinq  the  free  recall  tests. 
The  two  indices  were  computed  separately  for  each  strateqy  condition.  VRI 
and  P  were  computed  as  follows: 

VRI  =  the  percentaae  of  isolated  words  recalled  (position  6-10) 
minus  the  percentaqe  of  non-isolated  words  recalled 
(position  6-10) 

P  *  overall  percentaoe  of  words  recalled  from  all  positions 
(isolates  and  non-isolates) 

Non-isolated  words  from  both  Isolated  and  Control  series  were  used  to 
compute  the  VRI.  Only  words  oriainaily  presented  in  position  6  throuoh  10 
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were  used  in  the  computation  of  the  VR I  (in  order  to  match  the  positions  of 
isolates  and  non-i sol ates) .  In  the  computation  of  the  overall  recall 
performance  all  the  words  were  used. 

First  Session.  The  VRI  and  P  for  each  subject  in  the  first  session  as 
well  as  means  and  SDs  for  each  instruction  condition  are  presented  in  Table 
1. 


Insert  Table  1  about  here 


When  strateqies  were  not  manipulated,  there  was  little  variability  in  the 
overall  performance  of  the  subjects,  less  than  that  reported  in  the  Karis, 
Fabiani,  and  Donchin  (1984)  study.  On  the  other  hand,  the  VRI  was  highly 
unstable  in  this  condition,  perhaps  because  it  was  computed  from  only  8 
isolates,  with  a  difference  in  the  recall  of  one  isolate  accounting  for  a 
12%  difference  in  recall. 

Roth  performance  and  the  VRI  were  affected  when  strateqy  instructions 
were  aiven  to  the  subjects.  Subjects  recalled  significantly  fewer  words  for 
rote  than  for  elaborative  strategies,  F(l,9)  =  30.65,  p  <  .001.  The  VRI  was 
higher  for  rote  strategies  than  for  elaborative,  even  thouah  this  difference 
did  not  reach  significance,  F{1,9)  =  4.6R,  p  =  .069. 

Second  and  Third  sessions.  The  performance  and  VRI  of  each  individual 
subject  in  session  2  and  3,  as  well  as  the  mean  and  SD  for  each  strategy 
instruction  are  reported  in  Table  2. 


Insert  Table  2  about  here 


The  two  strategy  instruction  conditions  differed  significantly  in  both 
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performance  and  VRI.  Recall  performance  was  lower  for  rote  than  for 
elaborative  instructions ,  F(l,9)  =  18.84,  p  <  .002.  The  VRI  was  hiqher  for 
rote  than  for  elaborative  instructions,  F(l,9)  =  If. 54,  p  <  .003. 

Table  2  reveals  that  only  one  subject  (subject  8)  showed  a  lower  VRI 
under  rote  than  under  elaborative  instructions.  Given  that  the  behavior  of 
this  subject  was,  for  some  reason,  different  from  that  of  the  other 
subjects,  the  ERP  waveforms  of  this  subject  might  reflect  different 
processes  from  those  of  the  other  subjects.  Therefore,  the  analyses 
presented  in  the  following  paraoraphs  were  performed  on  the  nine  remaining 
subjects.  However,  the  analyses  were  also  performed  on  the  full  sample  of 
ten  subjects  and  any  differences  will  be  discussed. 

The  serial  position  curves  averaaed  over  nine  subjects  are  shown  in 
Fiaure  ?a  for  rote  strateaies  and  in  figure  2b  for  elaborative  strategies. 


Insert  Figure  2  about  here 


These  curves  show  the  averaqe  percentage  of  words  recalled  in  session  2  and 
3  for  each  of  the  15  positions.  Data  for  isolated  and  non-isolated  words 
are  plotted  separately.  Note  that  the  isolates  are  represented  as  a 
horizontal  line,  which  actually  indicates  the  average  percent  recall  of  the 
isolates  from  position  6  through  10.  This  was  done  because  there  were  too 
few  words  to  plot  every  position.  For  both  strategies  there  is  a  "primacy," 
as  well  as  a  "recency"  effect  in  recall  performance.  The  magnitude  of  the 
von  Restorff  effect  in  each  condition  is  represented  by  the  higher  elevation 
of  the  curve  for  the  isolated  items  relative  to  that  for  the  non-isolates  in 
comparable  positions  (6  through  10).  Clearly,  the  von  Restorff  effect  is 
larae  under  rote  instructions  (Figure  2a),  while  it  is  almost  absent  under 
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elahorative  instructions  (Fioure  2h).  Performance,  on  the  other  hand,  is 
biaher  under  elahorative  than  under  rote  instructions. 

Fioure  3  compares  the  results  obtained  in  the  present  study  with  the 
results  reported  by  Karis,  Fahiani,  and  Donchin  (19R4). 


Insert  Fioure  3  about  here 


The  averaoe  von  Restorff  index  is  plotted  in  Fioure  3  aoainst  the  averaoe 
overall  performance  for  each  strateay  condition.  The  circles  represent  rote 
strateaies,  and  the  trianoles  elahorative  strateaies.  The  means  from  the 
Karis,  Fahiani,  and  Donchin  (1RR4)  study  are  connected  by  a  solid  line.  The 
means  for  the  subjects  in  the  present  study  are  connected  by  a  dashed  line. 
Note  that  the  means  from  the  Karis,  Fabiani,  and  Donchin  (1984)  study  were 
computed  on  two  different  arouos  of  subjects,  who,  durinq  a 
oost-experimental  debriefinq,  reported  havinq  used  different  strateoies, 
while  in  the  present  study  the  same  subjects  performed  in  both  conditions. 
The  two  studies  yield  very  similar  data,  even  thouqh  the  difference  between 
mean  values  is  smaller  in  the  present  study  than  was  in  the  Karis,  Fabiani, 
and  Donchin  (1984)  study.  However,  it  is  important  to  note  that,  as  in  that 
study  the  subjects  chose  the  strateoies  they  preferred,  one  mioht  expect 
more  extreme  values  than  in  the  present  study. 

From  the  data  reported  so  far  we  conclude  that  our  strateay 
manipulation  was  successful  in  affectinq  both  recall  performance  (more  words 
were  recalled  by  usina  elaboratlve  strateqies)  and  the  VRI  (which  was  hiqher 
under  rote  instructions),  in  the  manner  predicted  by  the  Karis,  Fabiani,  and 
Donchin  (1QR4)  model.  In  the  section  on  ERP  waveforms  we  will  examine  the 
interaction  between  the  strateq.v  instructions  and  the  amplitude  of  P300. 
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Session  Effects.  The  data  reported  above  were  combined  across  the 
second  and  the  third  session.  The  data  were  also  examined  separately  for 
the  two  experimental  sessions,  and  yielded  comparable  results.  There  was  a 
main  effect  of  strategy  instructions  on  performance.  In  both  sessions 
subjects  recalled  more  words  when  using  elaborative  strategies  than  when 
usino  rote,  E(1,Q)  =  17.24,  p  <  .003.  There  was  also  a  main  effect  of 
strateay  instructions  on  the  VRI:  in  both  sessions  the  VRI  was  higher  for 
rote  than  for  elaborative  instructions,  F(l,°)  *  16.37,  p  <  .003.  In 
addition,  there  was  a  main  effect  of  session  on  performance,  with  subjects' 
overall  performance  improving  from  session  2  (48%)  to  session  3  (53%), 

F(1,R)  =  7.05,  p<  .03.  However,  there  were  no  significant  interactions 
between  session  and  strategy  instructions,  thus  supporting  the  legitimacy  of 
combining  the  values  from  the  two  sessions. 

Analysi s  of  ERP  Waveforms 

The  ERP  data  for  each  subject  were  combined  across  the  two  experimental 
sessions  (session  2  and  3). 

Isolation  Effect.  In  order  to  determine  whether  isolates  elicited 
larger  P300s  than  non-i solates,  and  whether  isolates  and  non-isolates 
elicited  similar  waveforms  for  the  two  strategy  instruction  conditions,  we 
first  compared  the  waveforms  elicited  by  the  isolates  with  the  waveforms 
elicited  by  the  non-isolates  in  the  same  positions  (6  through  10).  The  EEG 
records  of  each  subject  were  sorted  according  to  strategy  instructions  (rote 
or  elaborative),  word  class  (isolates,  non-isolates  in  isolated  series  and 
non-isolates  in  control  series)  and  position  (position  6-10  only).  The  ERP 
waveforms,  averaged  over  all  the  subjects  at  three  electrode  locations  (Fz, 
Cz,  and  Pz)  are  plotted  in  Figure  4.  in  this  figure,  non-isolates  coming 
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from  series  containing  an  isolate  and  from  control  series  are  plotted 
separately. 


Insert  Figure  4  about  here 


A  positive  peak  is  visible  in  all  the  waveforms.  We  label  this 
positive  peak  "P300"  because  its  peak  latency  (in  excess  o*  300  ms)  and  its 
scalp  distribution  are  characteristic  of  the  P300  component  of  the  EPP. 
Amplitude  values  are  positive  at  the  three  electrode  locations,  with  Pz  more 
positive  than  Cz,  and  Cz  more  positive  than  Fz.  Isolates  elicited  larger 

P300s  than  non-isolates,  while  non-isolates  from  isolated  and  control  series 

elicited  very  similar  waveforms.  This  result  is  consistent  with  the  general 
observation  that  task  relevant,  distinct  stimuli  elicit  a  larger  P300  than 
do  companion  stimuli  that  are  common  (e.g.,  Duncan-Johnson  A  Donchin,  1977 ) . 

It  can  also  be  noted  that  the  waveforms  for  rote  and  elaborative 

instructions  are  similar. 

Memory  Effect.  Given  that  isolates  did  elicit  P300s  we  proceeded  to 
determine  if  there  was  the  expected  relationship  between  the  strategy 
instructions,  the  amplitude  of  the  P300  and  the  subseauent  recall.  For  this 
analysis  the  EEG  records  of  each  subject  were  sorted  for  averaging  on  the 
basis  of  strategy  instructions  (rote  or  elaborative),  word  type  and  position 
(isolates,  non-isolates  in  position  6-10,  non-isolates  in  other  positions) 
and  subseouent  recall  (recalled,  not- recalled). 

ERP  waveforms,  averaged  over  all  subjects  at  Fz,  Cz  and  Pz  for  isolates 
recalled  and  not  recalled  are  presented  in  Figure  f  for  both  strategy 


instructions. 
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Insert  Fioure  5  about  here 


We  note  two  primary  aspects  of  these  waveforms.  First,  under  rote 
instructions  there  is  a  larqer  difference  in  P300  between  isolates  recalled 
and  not  recalled  than  under  elaborative  instructions;  this  is  most  evident 
at  the  central  and  parietal  electrodes.  Second,  there  is  a  difference 
between  isolates  recalled  and  not  recalled  at  the  frontal  electrode.  This 
is  most  evident  for  elaborative  instructions,  and  suqaests  the  presence  of  a 
frontal -positive  slow  wave. 

These  impressions  were  supported  by  means  of  statistical  analysis  on 
measures  of  P3O0  amplitude,  and  of  area  measures  of  the  frontal -positive 
slow  wave,  taken  on  the  waveforms  of  each  individual  suhiect.  The  amplitude 
of  P300  was  estimated  by  usinq  a  cross-covariance  procedure  (Fabiani  et  al . , 
in  press).  First,  the  P300  component  was  identified  at  Pz  as  the  seqment 
with  maximal  cross-correlation  with  a  template  (2  Hz,  1  cycle,  inverted 
cosinusoidal  wave),  in  a  time  window  ranqino  from  350  to  000  ms.  Next,  P300 
amplitude  was  assessed  by  computino  the  covariance  between  the  template  and 
the  seqment  of  Pz  waveform  Identified  as  P300.  Fabiani  et  al.  (in  press) 
demonstrated  that  the  use  of  cross-covariance  is  particularly  appropriate 
when  relatively  few  trials  are  entered  in  the  computation  of  the  averaqes. 

An  analysis  of  variance  was  applied  to  the  P300  amplitude  measures  to  test 
the  differences  In  amplitude  over  different  experimental  conditions.  A 
repeated  measures  desiqn  was  used  (ALICE  statistical  package,  proqram 
"ANOVA,"  Grubin,  Rauer  and  Walker,  1976).  This  analysis  confirmed  that 
isolated  words  elicited  larqer  P300s  than  other  word  types  (main  effect  of 
word  type),  F(14,ll?)  *  3.9R,  p  <  .0001.  In  addition,  words  that  were 


P300  AND  MEMORY 


26 


recalled  under  rote  instructions  (reqardless  of  their  type,  and  position) 
elicited  laraer  P300s  than  words  not  recalled,  while  there  was  no  difference 
between  words  recalled  and  not  recalled  under  elaborative  instructions 
(strateay  x  memory  interaction),  F(1,F)  =  5.72,  p  <  .05?  Note  that  this 
effect  is  present  for  all  the  words,  not  lust  the  isolates.  The  mean  P300 
amplitudes  (in  arbitrary  units)  are  presented  in  Table  3a  for  the  isolates 
and  in  Table  3b  for  all  the  words.  Note  that,  even  thouqh  the  amplitude  of 
P300  seems  to  be  sliqhtly  larqer  for  the  elaborative  instruction  condition, 
that  result  is  not  sianificant,  (F(1,R)  =  3.17,  p  =  .11). 


Insert  Table  3  about  here 


The  amplitude  of  the  P300  in  the  averaoe  waveforms  for  the  non  isolated 
words  appears  quite  small  (see  fiqure  4).  These  may  be  attributed,  in  part, 
to  subject-to-subject  variability  in  the  latency  of  P300.  Therefore,  in  the 
analysis  presented  above,  we  assessed  the  amplitude  of  P300  taking  into 
account  the  inter-subject  latency  variability.  We  also  computed  average 
waveforms  over  all  the  subjects,  where  the  waveforms  from  each  single 
subjects  were  shifted  so  that  the  P300  peaks  were  aligned.  These  waveforms 
are  shown  in  Figure  6  for  both  strateqy  conditions.  An  amplitude  difference 
between  words  recalled  and  not  recalled  is  noticeable  under  rote,  but  not 
under  elaborative  instructions? 


Insert  Figure  6  about  here 


As  we  mentioned,  a  positivity  at  the  frontal  electrode  was  evident  in 


the  waveforms  elicited  by  the  words  subsequently  recalled.  This 
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"frontal -positive  slow  wave"  is  more  Dositive  frontally  than  centrally  and 
parietal  ly  and  it  is  more  visible  when  subjects  are  qiven  elaborative 
strateay  instructions.  To  measure  this  slow  Dositivitv  we  measured  the  area 
under  the  curve  in  the  ERP  for  Fz  for  each  subject,  word  type  and  condition, 
in  a  time  window  ranaino  from  800  to  1180  ms.  An  area  measure  was  chosen  in 
this  particular  case  because  of  the  absence  of  a  clearly  defined  peak  for 
this  component. 

An  analysis  of  variance,  with  the  same  design  used  for  the  P300 
amplitude  measures,  was  applied  to  the  area  measures  of  the  frontal-positive 
slow  wave.  This  component  was  laraer  when  subjects  were  instructed  to  use 
elaborative  strateoies  than  when  they  were  instructed  to  use  rote  strategies 
(main  effect  of  strateqy)  F(l,8)  =  13.09,  p  <  .01.  This  component  was  also 
larqer  for  recalled  than  not  recalled  words  (main  effect  of  memory),  F(1,R) 

=  57.00,  p  <  .001. 

Thus,  the  analyses  of  the  ERP  waveforms  confirm  our  predictions:  the 
isolates  elicit  larqer  P300s  than  the  non-isolates  in  both  strateqy 
conditions.  In  addition,  the  relationship  between  P300  amplitude  and  memory 
emeroes  only  when  rote  rehearsal  strateoies  are  used.  Finally,  a 
frontal -positive  slow  wave  is  elicited  when  the  subjects  enqaqe  in 
elaborative  processing,  and  the  amplitude  of  this  component  is  also  related 
to  subsequent  recal  1 . 

Analysis  of  the^  Size-Pecall  Test 

The  size-recall  test  was  devised  to  test  the  hypothesis  that  the 
subjects  would  have  a  better  memory  of  the  si ze  at  which  the  word  was 
orioinally  displayed  when  the  words  had  been  memorized  under  rote  than  under 
elaborative  instructions.  Even  though  the  subjects  were  told  that  there 
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were  an  egual  number  of  words  originally  displayed  in  reaular  and  larae  size 
on  the  printed  list  they  received,  most  subjects  were  biased  toward 
responding  "reaular"  rather  than  "larae."  Therefore,  an  unbiased  measure  of 
accuracy  was  used  in  order  to  test  our  hypothesis.  The  accuracy  in 
identifyino  words  orioinally  displayed  as  larae  (isolates)  was  qiven  bv: 

Percentaae  of  larae- size  words  correctly  identified 
divided  by  the  total  number  of  "larae"  responses  aiven. 

The  accuracy  in  identifying  words  originally  displayed  in  "reqular"  size 
(non-i solates)  was  computed  in  an  analoaous  fashion  by: 

Percentage  of  regular-size  words  correctly  identified 
divided  by  the  total  numoer  of  "reaular"  responses  aiven. 

These  accuracy  indices  were  computed  separately  for  the  two  strateqy 
instruction  conditions.  Dur  hypothesis  was  confirmed:  subjects  were  more 
accurate  in  recalling  the  oriainal  size  of  the  words  presented  to  them  under 
rote  instructions  (72%)  than  under  elaborative  instructions  (58%)  -  main 
effect  of  strateay,  F(l,8)  =  7,28,  p  <  .03.  The  subjects  also  tended  to  be 
more  accurate  in  identifying  large  words  (isolates)  than  regular-size  words 
( non-i sol ates) ,  even  thouah  this  result  was  not  significant  (71%  of  the 
isolates  vs.  59%  of  the  non  isolates)  -  F(l,9)  =  4.81,  p  =  .06. 
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Discussion  and  Conclusions 

The  results  of  this  study  confirm  the  predictions  of  the  Karis, 

Fabiani,  and  Donchin  (1RR4)  model.  Our  strateay  manipulation  was  successful 
in  affectina  both  recall  performance  (more  words  were  recalled  bv  usina 
elaborative  strateaies  than  by  usina  rote)  and  the  VRI  (which  was  hiqher 
under  rote  instructions  than  under  elaborative).  The  isolates  elicited 
1 aroer  P300s  than  the  non-isolates  in  both  strateqy  conditions,  thus 
reinforcinq  the  claim  that  the  initial  processinq  of  the  items  is  the  same 
for  all  subjects,  independently  of  what  type  of  rehearsal  strateqy  is 
subsequently  used.  The  relationship  between  P300  amplitude  and  recall 
emeraed  only  when  rote  rehearsal  strateaies  are  used.  On  the  other  hand,  a 
frontal-nositive  slow  wave  was  elicited  when  the  subjects  were  enoaaed  in 
elaborative  processina,  and  the  amplitude  of  this  component  was  also  related 
to  subsequent  recall.  It  is  important  to  note  that  the  predictions  derived 
from  the  model  were  confirmed  not  only  for  the  isolated  words,  but  for  other 
words  as  well . 

Results  of  the  size-recall  test  suqqest  that  the  word  size  is  a 
"distinctive"  attribute  of  the  memory  representation  of  the  isolated  word. 
Subjects  were  sianificantly  more  accurate  in  identifyinq  the  size  of  the 
words  presented  to  them  under  rote  than  under  elaborative  instructions.  It 
is  plausible  that  under  both  strateay  instructions  the  size  attribute  was 
"marked"  as  distinctive  when  an  isolate  appeared  (a  larqer  P300  to  the 
isolates  than  to  the  non-isolates  was  observed  for  all  the  subjects). 
However,  when  the  words  were  memorized  under  rote  instructions,  the  subjects 
may  have  also  used  the  size  attribute  in  their  retrieval  search.  When  they 
were  instructed  to  use  elaborative  strateqies,  however,  thev  were  able  to 
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use  their  stories  or  imaaes  as  an  aid  to  retrieval,  and  did  not  "rehearse" 
the  size  attribute,  thus  havina  a  poorer  memory  for  it. 

Since  our  oriainal  experiment  (Karis,  Fabiani,  A  Donchin,  1984)  several 
studies  investi aat.inq  the  relationship  between  ERP  components  and  memory 
have  been  reported  (e.o.,  Fabiani,  et  al.,  19R6;  Johnson,  Pfefferbaum,  A 
Kopel 1 ,  1985;  Neville,  Idu ta s ,  Chesney,  A  Schmidt,  1986;  Paller,  l<litas,  A 
Mayes,  lq85).  These  studies  differed  in  several  respects  from  the  Karis, 
Fabiani,  and  Donchin  (1984)  study.  In  fact,  Fabiani  et  al .  (lq86)  used  a 
name  oddball  paradiom,  followed  by  an  incidental  free  recall  task.  Neville 
et  al .  (lq8fi)  presented  the  sub.lects  with  statements  followed  by  words  that 
could  be  either  semantically  conaruous  or  inconqruous  with  the  precedi  no 
statement,  and  used  an  incidental  recognition  to  test  memory  for  the  words. 
Paller  et  al .  (1985)  used  both  incidental  recall  and  incidental  recoqnition 
to  test  memory  for  words  that  were  previously  cateaorized  on  the  basis  of 
their  meanina  or  orthoqraphy.  Finally,  Johnson  et  al .  (1985)  used  a 
study-test  paradiom,  in  which  each  study  list  was  presented  to  the  subjects 
four  times,  and  was  followed  each  time  by  a  recoqnition  test  to  assess 
performance.  Ry  and  larqe,  these  studies  have  confirmed  that  a  larqe 
parietal  positivity  is  related  to  subsequent  memory  performance,  when 
conditions  similar  to  those  of  the  Karis,  Fabiani,  and  Donchin  (1984)  study 
were  employed.  When  siqnificant  variations  were  introduced  in  the  study, 
the  results  were  more  ambiouous.  For  instance,  i nvesti qators  usually  do  not 
examine  the  subjects  rehearsal  and  retrieval  strateoies.  Given  the  very 
powerful  effects  that  these  strateoies  have  on  the  relationship  between  P300 
and  recall,  it  is  evident  that  studies  in  which  this  factors  are  not  qiven 
consideration  may  yield  conflictino  results. 

It  is  also  important  to  emphasize  that  we  are  not  reportinq  a  qeneral 
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relationship  between  P300  and  memory.  The  Karis,  Fabiani  and  Donchin  (1Q841 
data  are  sometimes  interpreted  as  if  such  a  qeneral  claim  is  made.  The 
failure  to  confirm  this  putative  universal  relationship  between  P300  and 
memorv  is  then  considered  relevant  to  interpretation  of  the  P300  (e.o., 
Johnson  et  al.,  1R85).  It  mav  be  useful  therefore  to  reiterate  that  our 
theoretical  stance  does  not  assert  that  every  manipulation  known  to  affect 
"memory"  will  also  affect  P300.  Neither  are  we  suoqestinq  that  an 
enhancement  of  the  P300  should  accompany  each  improvement  in  the  subjects' 
ability  to  recall,  recognize,  or  otherwise  indicate  their  havinq  information 
in  memory.  Such  a  uni  versa! ist  position  would  not  make  much  sense  on 
oeneral  orounds,  and  it  is  clearly  inconsistent  with  the  demonstration  that 
P300  is  related  to  recall,  in  the  von  Restorff  paradiom,  only  if  rote 
rehearsal  strategies  are  used. 

The  label  "memory"  applies  to  a  larqe  variety  of  processes  whereby 
information  about  past  events  is  made  more  or  less  accessible  to  retrieval. 
It  is  doubtful  that  all  of  these  instances  are  the  result  of  the  application 
of  a  sinole  class  of  mechanisms.  Whether  an  event  will  be  recalled  or  not 
depends  on  a  larqe  number  of  independent  processes,  each  subject  to  a 
plethora  of  variables. 

Of  course,  speakino  of  memory,  one  is  speakinq  about  the  creation, 
storage,  maintenance  and  use  of  representations.  The  P300,  or,  more 
precisely,  the  process  that  the  P300  manifests,  can  make  contact  with  memory 
only  through  the  effect  that  its  invocation  may  have  on  the  establishment, 
manipulation,  or  access  to  these  representations.  The  specific  manner  with 
which  such  a  relationship  is  established  is  unknown  at  this  time.  The  data 
we  report  here  suoqest  that  the  invocation  of  the  P300  process,  and  the 
magnitude  with  which  it  is  invoked,  affects  in  some  manner  "episodic" 
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representations ,  that  is  those  representations  that  preserve  the 
character*  sties  of  the  stimuli  as  physical  entities.  We  infer  this  from  the 
fact  that  it  is  the  rote  memorizers  who  are  aided  by  the  conseauences  of  the 
P300,  and  from  the  fact  that  these  subjects  also  recall  the  physical  size 
with  which  the  word  was  presented.  The  specific  nature  of  the  way  the 
representations  are  "marked"  or  "activated"  remains  to  be  determined. 
Furthermore,  we  are  not  even  certain  that  the  invocation  of  the  P300  process 
is  the  direct  causal  effect  of  this  chanoe.  It  is  possible  that  the  markinq 
of  the  representation  and  the  elicitation  of  the  P300  are  both  the 
conseauences  of  yet  another  unspecified  process.  These  remain  issues  for 
further  research. 

It  may  he  useful  to  assert  also  a  methodoloaical  caveat.  We  suaqest 
that,  in  considerinq  relationships  between  P300  and  "memory,"  investiqators 
should  choose  paradiqms  in  which  they  narrow  the  ranae  of  variables  that  can 
affect  recall  to  variables  that  are  also  likely  to  affect  the  amplitude  of 
the  P300.  This,  for  example,  has  been  to  our  mind  the  benefit  of  the  use  of 
the  von  Restorff  paradiqm.  In  previous  work  (Karis,  Rashore,  Fabiani,  A 
Donchin,  1R84)  we  attempted  to  test  the  hypothesis  that  P300  amplitude  would 
predict  recall  by  usina  a  qeneral  recoqnition  paradiqm.  The  predicted 
relationship  between  P300  amplitude  and  recall  was  indeed  observed,  but  only 
weakly.  This  was  so,  we  believe,  because  the  amount  of  shared  variance 
between  the  processes  controllinq  P300  amplitude  and  those  controllinq 
"memory"  is  relatively  small  in  a  qlobal  recoqnition  experiment.  Ry 
introducinq  the  "isolates,"  Karis,  Fabiani,  and  Donchin  ( 1^84 )  could  study 
the  relationships  as  they  operate  in  the  context  of  stimuli  whose  recall  is 
laroely  controlled  by  the  isolation  and  this  very  isolation  is  known  to  have 
a  strona  effect  on  P300. 
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As  a  final  remark,  we  submit  that,  with  the  proDer  choice  of 
experimental  paradiqms,  P300  and  other  components  of  the  human  ERP  can  serve 
as  tools  in  the  study  of  coanitive  function.  The  utility  of  this  family  of 
tools  derives  from  their  special  ability  to  allow  direct  observation  of  the 
activity  of  intermediate  information  processinq  states.  In  the  main,  the 
standard  methodoloqy  of  Copnitive  Psychol ooy  depends  on  what  may  be  called 
"fi nal -outcome"  studies.  That  is,  what  is  available  for  observation  are 
specific  behavioral  acts,  such  as  button  presses  or  verbal  reports.  These 
acts  are  the  observable  conseauences  of  a  multiplicity  of  information 
processinq  activities  (not  necessarily  serial).  A  diversity  of  methods 
permits  inferences  reqardinq  the  fine  structure  of  the  human  information 
processinq  system  from  observations  of  these  final  outcomes  and  their 
distributions.  The  data  presented  here  illustrate  the  process  whereby  the 
study  of  human  information  processinq  can  be  auqmented  by  use  of 
observations  on  such  intermediate  outputs  as  the  P300.  This  component 
serves  as  an  index  for  activity  that  takes  place  durinq  the  encodinq  staqes, 
and  this  activity  definitely  affects  subseauent  recall,  as  a  function  of 
rehearsal  strateqies.  Thus,  these  data  can  serve  in  the  development  of 
models  of  memory. 
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Appendi  x 


Strategy  Instructions. 

General  Instructions .  This  time  we  want  you  to  use  particular 
strategies  to  memorize  the  words.  There  are  many  strategies  people  can  use 
to  remember  lists  of  words  in  experiments  like  this  one.  We  want  everyone 
to  use  the  same  strategies,  because  if  everyone  does  something  different  we 
often  get  confusing  data  and  don't  have  any  idea  why  the  results  turn  out 
the  way  they  do.  This  is  why  we  asked  you  what  strategies  you  used  during 
the  first  part.  So,  it's  very  important  that  you  try  to  follow  my 
instructions,  and  if  you  have  trouble  doing  this,  it's  important  that  you 
tell  me  when  I  ask  you  afterwards.  There  are  two  boxes  on  top  of  the  recall 
sheets,  one  wi  th  a  Y  and  one  with  a  N.  This  is  so  you  can  indicate  whether 
or  not  you  used  the  appropriate  strategies,  Y  for  yes,  N  for  no.  You  must 
choose  one.  Rick  the  one  that  best  represents  what  you  did.  If  you  used 
primarily  the  appropriate  strategy,  check  the  Y  box.  Otherwise,  check  the  N 
box.  I  want  you  to  remember  as  many  words  as  you  can,  of  course,  but  it's 
even  more  important  that  you  follow  the  strategies  I  describe.  If  this 
makes  it  harder  for  you  to  recall  the  words,  that's  okay,  because  the  most 
important  thinq  is  to  use  the  correct  strategy.  You'll  be  using  two 
different  strategies,  and  when  I  describe  the  second  you  may  want  to  keen  on 
using  the  first,  or  you  may  want  to  use  some  other  strategy  that  you  think 
is  better.  Please  don't!  Try  very  hard  to  do  what  I  suggest.  If  that's 
very  hard  I  want  you  to  tell  me,  because  that's  another  source  of 
information  that  may  he  helpful  to  us  when  we  analyze  the  data.  Honesty,  of 
course,  is  crucial,  and  it's  important  that  you  accurately  describe  what  vou 
do.  One  more  thing:  sometimes  you  may  recall  a  word  but  not  be  sure  whether 
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it  came  from  the  last  list,  or  from  some  earlier  list.  Or  you  may  recall  a 
word  from  a  previous  list  that  you  didn't  write  down  before.  In  these  cases 
you  should  write  down  the  words  on  the  sheet  you  are  usinq. 

Rote  Rehearsal  Strategy  Instructions.  I  want  you  to  repeat  the  words 
silently  to  yourself.  You  can  repeat  each  word  until  the  next  appears,  or 
you  can  repeat  both  the  word  presented  and  some  of  the  previous  words.  For 
example,  if  the  first  four  words  in  a  list  are  kid,  green ,  fit ,  and  bank , 
you  could  repeat  each  word  three  times  to  yourself  before  the  next  was 
presented  -  kid,  kid,  kid  ...  qreen,  qreen,  preen  ...  fit,  fit,  fit  .... 

Or,  if  you  wanted,  you  could  also  repeat  the  previous  words:  kid,  kid,  kid; 
areen,  kid,  qreen,  kid;  fit,  qreen,  kid,  fit,  qreen,  kid...  Of  course, 
after  several  words  have  been  presented,  you  will  not  be  able  to  repeat  them 
all  before  the  next  appears.  That's  OK,  of  course.  There  are  many  other 
ways  to  repeat  the  words.  For  example,  repeatinq  the  last  word,  alonq  with 
iust  one  of  the  previous  words:  kid,  kid,  kid  ...  areen,  qreen,  kid  ... 
fit,  fit,  areen  ...  bank,  bank,  fit  ....  You  don't  have  to  stick  to  one  of 
these  methods,  of  course.  You  may  want  to  combine  them,  or  chanoe  from  one 
to  another.  The  important  thinq  is  to  repeat  the  words,  in  whatever  way  you 
choose.  You  should  not  talk  out  loud,  however,  or  even  move  your  lips  and 
mouth,  because  it's  possible  that  such  movements  could  interfere  with  our 
recordinq. 

Elaboration  Rehearsal  Strategy  Instructions.  I  want  you  to  try  to 
connect  or  orqanize  the  words  in  some  way.  You  can  make  sentences  out  of 
them,  you  can  form  imaaes  or  pictures  with  them,  you  can  make  a  story  out  of 
them,  vou  can  assian  them  to  different  categories,  or  use  other  methods  that 
occur  to  you.  It  is  very  difficult,  of  course,  to  use  all  15  words 
together,  e.q.,  in  the  same  story,  but  you  don’t  have  to.  You  can  have 
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several  groups  of  a  few  words,  or  one  or  two  larger  aroups,  or  whatever 
seems  easiest.  You  may  not  be  able  to  use  every  word.  That’s  okay,  just 
try  to  do  the  best  you  can.  You  don’t  have  to  combine  words  that  were 
presented  toaether.  It’s  okay  to  combine  early  words  with  later  words,  or 
words  from  one  part  of  the  list  with  words  from  another.  Let’s  say  a  1  i st 
started  like  this:  kid,  qreen,  fit,  hank,  horse,  durinq,  final,  sun,  feed, 
mend,  took,  spend,  tend,  fez.  (Experimenter  holds  up  card  with  all  the  words 
on  it.)  Now,  I  didn't  pick  these  words  at  random.  I  chose  them  to 
illustrate  some  of  the  many  methods  that  can  be  used  to  remember  words.  The 
lists  you  will  see,  however,  are  composed  of  words  picked  at  random  from  a 
master  list. 

There  are  no  correct  or  incorrect  ways  to  do  this.  Whatever  helDS  is 
okay.  Here  are  some  examples.  You  miqht  create  a  sentence:  The  kid  rode 
the  oreen  horse  to  the  bank  durinq  the  sun- day  fair.  Or  you  could  imagine 
that  scene,  or  use  both  the  sentence  and  the  scene.  Combi ninq  this  many 
words  is  difficult.  You  could  .lust  imaqine  a  green  horse  or  a  kid  having  a 
fit,  or  a  kid  turninq  green  duri ng  a  fit,  and  so  on.  You  also  miqht  combine 
words  that  start  with  the  same  letter  or  letters:  fit,  final ,  feed,  fez,  or 
words  that  rhyme:  mend,  spend,  tend,  although,  if  possible,  it  is  better  to 
connect  or  oroanize  based  on  meaninq,  rather  than  sound  or  letters.  Some 
people  find  it  helps  to  form  imaoes  or  pictures  of  scenes  containing  the 
objects  that  must  be  remembered.  Many  words,  of  course,  are  not  easy  to 
visualize.  These  mioht  be  combined  with  other  words  that  are,  or  made  into 
a  sentence,  or  grouped  in  ways  that  don’t  require  visualization.  For 
example,  the  word  "quilty"  is  hard  to  visualize  by  itself.  However,  if  the 
words  ’’doo"  and  "steak"  were  also  presented  somewhere  in  the  list,  you  could 
think  of,  or  visualize,  a  doq,  looking  guilty  after  stealino  a  steak .  The 


P300  AND  MEMORY 


AA 


important  point  should  he  clear  -  if  it's  not,  tell  me.  I  want  you  to  do 
more  than  iust  repeat  the  words  to  yourself.  By  doi nq  more  I  mean  such 
thinos  as  oraanizina  or  combinina  the  words  as  I've  described,  or 
visualizino  them,  or  combinations  of  these  various  technioues,  or  other 
similar  technioues. 
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Footnotes 

1.  For  example,  with  two  eaui probable  events,  A  and  R,  the  P300  elicited  by 
the  last  Aina  seauence  will  be  smaller  in  the  sequence  AA  than  RA. 
Similarly,  in  third  order  series,  P300  amplitude  will  decrease  from  RRA  to 
ARA  to  RAA  to  AAA. 

?.  We  use  the  concept  of  workinq  memory  to  emphasizes  function  (see,  for 
example,  Raddeley,  19R1,  and  Raddeley  &  Hitch,  1974),  as  opposed  to 
short-term  memory,  which  has  often  been  used  to  refer  to  a  hypothetical 
structure. 

3.  This  component  was  labeled  "frontal-positive  slow  wave"  to  distinouish  it 
from  the  more  typical  slow  wave  reported  by  several  investiqators  (see 
Souires,  Squires,  A  Hillyard,  197R;  and  Ruchkin  A  Sutton,  1QR3).  In  fact, 
the  typical  slow  wave  is  neoative  frontally,  becomina  more  and  more  positive 
as  one  moves  back  across  the  scalp. 

4.  in  the  "oddball  oaradiqm"  the  subject  is  presented  with  a  sequence  of 
events  and  a  cateoori zation  rule  is  provided  that  classifies  each  event  into 
one  of  two  cateoori es.  The  events  are  presented  in  a  Bernoulli  sequence, 
and  the  subject  is  instructed  to  respond  to  each  event  in  some  manner  that 
requires  attention  to  the  cateoorization  rule.  In  qeneral ,  if  one  of  the 
cateqories  appears  more  rarely  than  the  other,  the  stinwli  belonqinq  to  that 
cateoory  will  elicit  a  P300,  whose  amplitude  will  be  inversely  prooortional 
to  probability.  However,  the  rarity  is  neither  necessary,  nor  sufficient, 
to  elicit  the  P300.  The  task  relevance  of  the  events  is  a  key  element  in 
determini nq  the  apoearence  of  P300. 
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5.  Roth  the  position  effect  and  the  strateay  by  memory  interaction  were 
replicated  with  the  entire  10-subject  sample,  even  thouoh  the  E  values  were 
sliohtly  different.  In  addition,  other  measurement  procedures  were  applied: 
Vector  Filter  (Gratton,  Coles,  A  Donchin,  1PR6)  was  applied  in  conjunction 
with  oeak-pickinq  and  cross-covari ance,  and  oeak-pickinq  measures  were  taken 
at  Pz,  for  both  the  10-  and  the  o-suhject  sample.  All  these  analyses 
yielded  similar  results. 

6.  Note  that,  by  shiftinq  the  waveforms  so  that  the  peaks  of  the  sinqle 
subjects'  P300s  were  aliened,  the  possibility  of  examininq  other  EPP 
components  (not  time-locked  to  P300)  was  lost. 
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Table  1 

Performance  ( p_)  and  von  Pestorff  Index  ( VRI )  (Percentages)  1  n  Session  1 
(Raseline  and  Training  Session). 


$# 

None 

P(a) 

VRI(b) 

Strateay  Instructions 

Rote 

P  VRI 

Elaborative 

P  VRI 

1 

68 

15 

40 

17 

64 

12 

2 

S3 

37 

6ft 

4 

65 

-8 

3 

46 

17 

39 

33 

56 

1 

4 

4ft 

-47 

2ft 

36 

46 

2 

5 

R1 

10 

45 

65 

65 

22 

fi 

42 

36 

32 

33 

60 

-13 

7 

43 

11 

37 

-ft 

41 

34 

ft 

4ft 

24 

44 

20 

63 

12 

Q 

4Q 

24 

38 

25 

63 

-15 

10 

SO 

-1 

47 

20 

59 

12 

M(  SO) 

49(5) 

13(24) 

42(11) 

23(17) 

57(9) 

6(16) 

(a)  P  (Performance) :  %  recalled  from  all  positions 

(b)  VRI  (von  Restorff  Index):  %  Isolates  recalled  minus  %  non-lsolates 
recalled  (position  6-10) 
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Table  2 

Performance  (P)  and  von  Restorff  Index  (VRI)  (Percentages) 
j n^  Session  and  3^  ( Experimental  Sessions,  Combi ned 
Values) . 


s# 

Strateqy 

Rote 

P(a)  VRI (b) 

Instructions 

Elaborative 

P  VRI 

1 

64 

31 

72 

4 

2 

66 

10 

69 

-1 

3 

40 

36 

49 

8 

4 

30 

16 

43 

-3 

8 

46 

28 

68 

1 

6 

38 

19 

36 

-7 

7 

38 

12 

40 

10 

8 

41 

16 

64 

23 

0 

38 

15 

64 

10 

10 

48 

22 

67 

-3 

M(SD) 

44(10) 

20(9) 

57(14) 

4(9) 

(a)  P  (Performance):  %  recalled  from  all  positions 

(b)  VRI  (von  Restorff  index):  %  isolates  recalled  minus  %  non-isolates 
recalled  (position  6-10) 


P300  AND  MEMORY 


48 


Table  3 

P300  Amplitudes  (Arbitrary  Units!  as  Measured  by  a 


Cross-Covariance  Procedure  at  Pz,  for  Isolates  and 

for  All 

the  Words. 

A.  ISOLATES 

Strateqy  Recalled 

Not-Recalled 

Rote  28.3 

18.7 

Elaborative  26.6 

24.8 

P.  ALL  THE  WORDS 

Strateoy 

Recalled 

Not-Recalled 

Rote 

16.0 

12.R 

Elaborative 


16.0 


16.4 
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Fiqure  Captions 

Fiqure  1.  Experimental  desian. 

Fiqure  2.  Averaoe  serial  position  curves  for  rote  strateoy  (2a)  and 
elaborative  strateqies  (2b).  Data  for  isolated  and  non-isolated  words  are 
plotted  separately.  The  isolates  are  represented  as  solid  horizontal  lines, 
indicatino  tbe  averaqe  percent  recall  of  the  isolates  from  position  6 
throuoh  10.  The  curves  for  the  non-isolates  are  represented  by  dashed  lines. 

Fiqure  3.  The  results  obtained  in  the  present  study  are  compared  with 
the  results  reported  by  Karis,  Fabiani,  and  Donchin  (1RR4).  The  averaoe  VRI 
is  plotted  aqainst  the  averaoe  recall  performance  for  each  strateqy 
condition.  The  circles  represent  rote  strateoies,  and  the  tri ancles 
elaborative  strateqies.  The  means  from  the  Karis,  Fabiani,  and  Donchin 
(10RA)  study  are  indicated  by  a  solid  line.  The  means  for  the  subjects  in 
the  present  study  are  indicated  by  a  dashed  line. 

Fioure  4.  The  ERP  waveforms,  averaped  over  all  subjects  at  three 
electrode  locations  (Fz,  Cz,  and  Pz)  for  isolates  (solid  line),  non-isolates 
cominq  from  isolated  series  (dashed  line)  and  from  control  series  (dotted 
line).  The  rote  strateoy  waveforms  are  plotted  in  the  left  panel,  the 
elaborative  strateqy  waveforms  in  the  riqht  panel. 

Fiaure  5.  ERP  waveforms  averaoed  over  all  subjects  at  Fz,  Cz  and  Pz  for 
isolates  recalled  (solid  line)  and  not  recalled  (dashed  line).  The  rote 
strateqy  waveforms  are  plotted  in  the  left  panel,  the  elaborative  strateoy 
waveforms  in  the  riqht  panel. 

Fiqure  fi.  ERP  waveforms,  averaoed  over  all  subjects,  at  Fz,  Cz  and  Pz 
for  all  subjects  and  positions,  latency  adjusted  on  the  basis  of  the  P300 
peak  for  both  strateoy  conditions.  Recalled  words  are  represented  by  a 
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solid  line,  not  recalled  words  by  a  dashed  line.  The  vertical  dashed  lines 
represent  the  limits  of  the  time  window  in  which  P300  amplitude  was 
assessed.  The  rote  strategy  waveforms  are  plotted  in  the  left  panel,  the 
elaborative  strateoy  waveforms  in  the  riqht  panel. 
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SUMMARY 

SUMMARY 

This  paper  describes  the  development  and  testing  of  a  "mental 
prosthesis"  that  provides  a  means  by  which  a  person  who  is  incapable  of 
producing  speech,  or  using  any  other  motor  system  (i.e..  Is  "locked-in") , 
can  communicate  through  a  computer  utilizing  the  P300  component  of  the 
event-related  brain  potential  (ERP).  The  26  letters  of  the  alphabet, 
together  with  several  other  symbols  and  commands,  are  displayed  on  a 
computer  screen  which  serves  as  the  keyboard  or  prosthetic  device.  In  the 
developmental  studies  reported  in  this  paper,  undergraduate  volunteers  took 
part  in  two  experimental  sessions.  In  the  first  session  (the  pilot 
study/training  session)  subjects  attempted  to  spell  a  word  and  convey  It  to 
a  voice  synthesizer  for  production.  In  the  second  session  (the  analysis  of 
the  operating  characteristics  of  the  system)  subjects  were  required  simply 
to  attend  to  individual  letters  of  a  word  for  a  specified  number  of  trials 
while  data  were  recorded  for  off-line  analysis.  The  purpose  of  the  first 
session  was  to  determine  the  feasibility  of  the  mental  prosthesis.  As 
subjects  viewed  the  computer  screen  and  attempted  to  spell  the  word, 
elements  of  the  matrix  were  flashed  In  a  random  sequence  and  the  particular 
flash  that  invoked  a  P300  was  detected.  Convergence  on  the  attended  element 
was  achieved  following  a  series  of  presentations  In  which  those  elements  of 
the  matrix  that  did  not  elicit  P300s  of  sufficient  amplitude  to  define  the 
critical  element  were  eliminated.  This  procedure  was  followed  until  the 
word  was  spelled,  and  the  command  to  route  the  word  to  a  voice  synthesizer 
was  Identified.  Each  subject  completed  this  task  successfully.  Our  aim  In 
the  second  session  was  to  assess  the  efficiency  of  different  methods  for 
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Identifying  the  P300  to  the  attended  letter.  That  is,  in  the  second  test 
session  we  assessed  the  operating  characteristics  of  the  system  as  a  means 
of  Information  transfer  by  analyzing  the  relative  efficacy  of  various 
algorithms  for  detecting  P300.  Our  results  in  both  test  sessions  suggest 
that  the  Mental  Prosthesis  may  prove  to  be  a  useful  contnunlcation  device  for 
patients  who  have  no  other  means  of  expression. 
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INTRODUCTION 

Several  hundred  studies  have  demonstrated  that  when  subjects  are 
presented  with  a  series  of  stimulus  events  that  can  be  classified  into  2 
categories,  one  of  which  Is  rare,  task-relevant,  or  both,  these 
rare/relevant  events  elicit  an  event-related  brain  potential  (ERP)  with  an 
enhanced  positive-going  component  with  a  latency  of  about  300  msec,  labelled 
the  P300.  (for  reviews  see  Pritchard,  1981;  Hillyard  and  Kutas,  1983;  and 
Donchin  et  al.,  1986.)  The  amplitude  of  this  component  Is  largest  to  the 
task-relevant  or  target  event  and  varies  Inversely  with  the  probability  of 
occurrence  of  that  event;  that  is,  the  less  frequent  the  target,  the  larger 
the  P300  amplitude.  The  elicitation  of  P300  depends  critically,  therefore, 
on  the  subject's  ability  to  discriminate  the  events  and  assign  them  to  the 
appropriate  categories.  This  experimental  arrangement  has  come  to  be  called 
the  "oddball"  paradigm,  as  the  appearance  of  P300  can  be  used  as  an 
indication  that  the  subject  has  recognized  the  occurrence  of  an  event 
belonging  to  the  rare  category. 

Note  that  it  Is  not  necessary  for  the  subject  to  report  the  occurrence 
of  the  a  target  event  through  any  overt  means  (e.g.,  button  press).  Indeed, 
in  the  classic  oddball  task,  the  subject  is  often  required  only  to  maintain 
a  running  mental  count  of  number  of  occurrences  of  the  target.  Thus,  the 
appearance  of  the  P300  signals  the  subject's  recognition  of  the  rare, 
task-relevant  events  without  recourse  to  verbal  or  motor  means  of 
communication.  This  attribute  of  the  P300  suggests  that  it  may  be  possible 
to  develop  a  "Mental  Prosthesis"  utilizing  the  oddball  paradigm  that  would 
permit  persons  who,  as  a  result  of  neurological  Injury  or  disease,  are 
Incapable  of  any  overt  means  of  communication.  In  this  paper  we  describe 
such  a  Mental  Prosthesis  and  provide  a  detailed  evaluation  of  the  utility  of 


Mental  Prosthesis 


6 


the  P300  as  a  communication  channel. 

Prostheses  are  generally  used  to  execute  the  function  of  a  damaged  or 
dysfunctional  motor  system  by  employing  a  substitute  motor  system.  In 
recent  years,  refinements  in  prosthetic  devices  have  been  made  to  aid  in 
communication  by  employing  various  motor  systems.  For  example,  quadriplegic 
patients  with  good  control  of  the  neck  muscles  can  activate  buttons  with  a 
rod  attached  to  the  forehead.  Increasingly  sophisticated  prostheses  have 
been  developed  to  allow  Individuals  with  more  severe  disabilities  to 
communicate  through  the  extension  of  remaining  motor  functions.  Recent 
developments  Include  a  typewriter  that  can  be  operated  by  means  of  a  light 
beam  directed  by  head  movements,  (Soede  et  al.  1974);  a  typewriter  system 
controlled  by  a  dental  palate  key  operated  with  Morse  code  (Saarnlo,  1974); 
a  typewriter  operated  by  EMG  signals  (Torok,  1974);  a  switch  that  can  be 
operated  by  slight  hand  or  toe  movements  (Hammond,  1974);  a  variety  of 
switches  that  can  be  Interfaced  with  communication  boards,  such  as  a  switch 
that  can  be  operated  by  Interrupting  a  beam  of  light  with  any  part  of  the 
body  that  can  be  moved  (e.g.,  the  tongue)  (Hardiman  et  al.,  1979);  several 
efficient  systems  based  on  connecting  a  joystick  with  a  keyboard  (Shwedyk 
and  Gordon,  1977;  Vasa  and  Lywood,  1975);  and  a  portable  system  capable  of 
producing  a  liquid  crystal  display  of  one  of  a  number  of  pre-programmed 
messages  that  can  be  attached  to  an  electric  wheelchair  and  controlled  by 
the  motion  of  a  joystick  (Jardlne  et  al.,  1983). 

It  Is  not  unusual  for  the  the  oculomotor  system  to  remain  functional 
when  other  voluntary  motor  systems  are  damaged  quite  severely.  Numerous 
devices  have  been  developed  to  take  advantage  of  this  by  detecting  eye 
position  as  a  means  of  communication.  A  variety  of  means  have  been  utilized 
to  detect  eye  posttlon.  Including  reflection  via  mirrors  and  refraction  via 
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prisms  (Kate  et  al.  1984),  focusing  an  image  of  the  eye  and  eyelids  on  a  set 
of  photocells  (Warden,  1977),  measuring  corneal  reflection  of  Infrared 
light  (Rinard  and  Rugg,  1976;  Rubin  and  Stark,  1984),  and  visual  evoked 
potentials  (Sutter,  1983). 

All  of  the  systems  described  above  substitute  one  motor  system  for 
another.  In  some  patients,  however,  no  functional  voluntary  motor  systems 
remain  sufficiently  Intact  to  permit  their  utilization  for  communication, 
even  though  the  patients  retain  sensory  and  cognitive  abilities,  for  such 
individuals,  the  only  option  available  for  communication  would  be  a  "mental 
prosthesis"  that  utilizes  non-motor  manifestations  of  mental  activity  for 
comnunlcatlon.  The  system  we  describe  here  allows  the  subject  to  push  a 
metaphorical  switch  by  focusing  attention  on  one  of  a  series  of  stimulus 
events.  The  discrimination  between  the  event  on  which  the  subject  Is 
focusing  and  the  other  events  In  the  series  carries  the  information  that  the 
subject  is  communicating.  By  detecting  which  of  the  events  in  the  series 
generates  a  P300,  the  appropriate  computer- implemented  algorithm  can 
Identify  the  message  the  subject  Is  trying  to  communicate  and  send  it  for 
him  or  her  (Farwell  et  al.,  1986;  Donchln,  1987). 

The  system  works  as  follows:  a  6-by-6  matrix  containing  the  letters  of 
the  alphabet  and  a  few  one-word  commands  (see  Figure  1)  Is  displayed  on  a 
computer-controlled  CRT  screen.  The  "stimulus  events"  that  occur  In  the 
test  consist  of  Intensifications  of  either  a  row  or  a  column  of  the  matrix. 
The  subject  Is  attempting,  at  any  Instant,  to  communicate  the  contents  of 
one  cell  In  the  matrix.  As  the  subject  focuses  attention  on  that  cell,  the 
column  and  the  row  containing  the  cell  become  "relevant"  events.  There  are 
twelve  possible  events  (6  rows  and  6  columns)  only  2  of  which  are  relevant. 
These  events  are  therefore  both  task-relevant  and  rare.  Thus,  any  flash 
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that  contains  the  cell  on  which  the  subject  Is  focusing  should  elicit  a 
P300.  The  amplitude  of  the  P300  following  each  flash  Is  assessed,  and  the 
attended  cell  Is  Identified  as  the  cell  at  the  Intersection  of  the  row  and 
column  that  elicit  the  largest  P300s. 

We  report  here  a  study  In  which  4  healthy  volunteers  used  the  system  to 
comnunlcate  a  5-letter  word  to  a  computer.  The  primary  purpose  in  this 
study  Is  to  determine  the  number  of  trials  and  the  rate  of  event 
presentation  that  are  required  to  achieve  a  specified  level  of  accuracy  in 
communication.  In  a  clinical  situation,  of  course,  each  choice 
distinguished  by  the  system  could  be  used  to  communicate  an  entire  word, 
phrase,  or  sentence,  rather  than  simply  an  Individual  letter. 

METHOOS 

Subjects 

Four  healthy  subjects,  3  females  and  one  male,  whose  ages  ranged  from 
20  to  36  years,  participated  In  the  study. 

Data  Acquisition  and  Analysis 

The  electroencephalogram  (EEG)  was  recorded  from  Ag-AgCl  Beckman 
Biopotential  electrodes  placed  at  the  Pz  (parietal)  site  (10-20 
International  system),  referred  to  linked  mastolds.  This  electrode  site  was 
chosen  because  It  Is  where  the  largest  amplitude  P300  Is  recorded  In  young 
adults  (Pritchard,  1981;  Kutas  and  Hlllyard,  1983;  Dcnchln  et  al.,  1986). 
Electro-oculogram  (EOG)  was  recorded  from  sub-  and  supraorbital  electrodes 
(above  and  below  the  right  eye).  The  subjects  were  grounded  at  the 
forehead.  Electrode  Impedance  did  not  exceed  5  kllohm.  Brain  electrical 
activity  was  amplified  by  Grass  model  12  amplifiers  with  low-  and  high-pass 
filters  set  at  half-amplitude  frequencies  of  35  and  0.01  Hz,  respectively. 
These  signals  were  digitized  at  a  rate  of  50  samp1es  per  second.  Data  were 
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analyzed  in  real  time  in  the  pilot/training  session  and  off-line  in  the 
assessment  session,  both  of  which  are  described  in  detail  below. 

Pi lot/Traininq  Session 

The  purpose  of  this  session  was  twofold:  1)  to  determine  if  it  is 
possible  to  utilize  the  P300  as  a  communication  channel  that  would  have 
sufficient  speed  and  accuracy  to  be  of  practical  use,  by  employing  a 
real-time  data  analysis  procedure;  and  2)  to  familiarize  the  subjects  with 
the  apparatus  and  procedures  of  the  Mental  Prosthesis  and  give  them  some 
practice  in  utilizing  the  system. 


Insert  Figure  1  About  Here 


Subjects  were  presented  with  a  6-by-6  matrix  whose  cells  contained  the 
letters  of  the  alphabet  as  well  as  several  one-word  commands  for  controlling 
the  system  (Figure  I).  The  matrices  were  displayed  on  a  computer-controlled 
CRT.  In  each  "trial,"  each  of  the  the  6  rows  of  the  matrix,  or  each  of  the 
6  columns,  were  intensified  for  a  period  of  100  msec.,  with  an  Interval  of 
500  msec  from  the  beginning  of  the  intensification  of  each  row/column  and  of 
the  subsequent  row/column  to  be  intensified.  The  rows  were  selected  for 
Intensification  in  a  random  order,  and  then  the  columns  were  Intensified  in 
a  similar  manner. 

Subjects  were  Instructed  to  attend  to  a  given  letter  and  to  keep  a 
running  mental  count  of  the  the  number  of  times  It  flashed.  They  completed 
5  blocks  of  120  trials  each  under  these  conditions.  The  fifth  block  of 
trials  for  each  subject  in  the  pilot/training  session  provided  an  average 
waveform  for  the  P300  to  a  rare,  task-relevant  event  that  was  used  as  a 
template  in  subsequent  the  signal-detection  block  described  below. 
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In  the  sixth  block  of  trials,  we  used  a  real-time  signal-detection 
algorithm  to  allow  subjects  to  communicate  using  the  Mental  Prosthesis. 

Their  task  was  to  have  the  computer  generate  the  word  "BRAIN."  We  employed 
a  covariance  signal-detection  algorithm  that  is  described  in  detail  below, 
as  Is  the  stimulus  presentation  procedure  we  used. 

Trials  with  muscle  or  EOG  artifact  were  eliminated.  Values  for  E06 
artifact  rejection  were  determined  on  the  basis  of  peak-to-peak  EOG  response 
amplitude,  and  of  muscle  artifact  rejection  on  the  basis  of  mean  absolute 
deviation  of  the  Pz  channel. 

Subjects  selected  each  of  the  letters  in  the  word  "BRAIN"  In  turn,  and 
silently  counted  the  flashes  of  the  row  or  column  containing  the  letter 
until  the  system  displayed  the  letter  It  had  selected  in  a  specified 
position  on  the  screen  (see  Figure  1).  After  the  letters  spelling  the  word 
“BRAIN"  had  been  displayed,  the  subject  selected  the  "TALK"  command,  and  the 
word  was  sounded  by  means  of  a  Votrax  speech  synthesizer.  In  a  few  cases, 
an  incorrect  letter  selection  was  made,  and  the  subject  used  the  BKSP 
(backspace)  command  to  correct  the  error. 

The  system  recognized  the  selections  on  the  basis  of  a  comparison  of 
the  covariances  of  the  ERPs  following  each  flash  with  the  template  described 
above.  (The  covariance  signal-detection  algorithm  is  described  In  more 
detail  below.)  All  of  the  subjects  completed  the  task  successfully.  That 
is,  they  were  able  to  use  the  Mental  Prosthesis  to  spell  the  owrd  and  to 
transmit  It  to  the  voice  synthesizer  for  production. 

In  the  pllot/tralnlng  session,  our  concern  was  to  determine  whether  the 
system  could  be  employed  by  the  subjects  and  to  give  them  some  training  on 
the  system.  Consequently,  ERP  data  were  analyzed  In  real  time,  but  were 
not  recorded  on  any  storage  medium.  Information  on  the  time  required  for 
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each  selection  was  not  retained. 

Analysis  of  the  Operating  Characteristics  of  the  System 

The  pilot/training  session  demonstrated  that  the  P300  elicited  in  these 
circumstances  can  be  used  as  a  switch  subjects  can  employ  to  communicate  a 
choice  of  one  out  of  36  Items.  The  process  depends,  of  course,  on  the 
presentation  of  many  stimuli  and  on  the  assessment  of  the  P300  following 
each  flash.  The  effectiveness  of  this  procedure  depends  on  the  degree  to 
which  the  message  can  be  communicated  with  a  small  number  of  trials  using  an 
efficient,  cost-effective,  on-line  detector  of  the  P300.  We  recalled  the 
same  4  subjects  and  ran  10  additional  blocks  to  assess  the  the  relationship 
between  the  accuracy  with  which  the  cell  selected  by  the  subject  was 
identified  and  the  number  of  trials  used  in  the  detection.  Four  different 
methods  for  detecting  the  P300  were  used.  The  data  were  recorded  on  disk 
for  off-line  analysis. 

In  an  attempt  to  Improve  the  efficiency  of  the  system,  we  compared  2 
different  intervals  between  row/column  flashes.  Half  of  the  blocks  were  run 
with  a  125  msec  delay  between  the  onset  of  the  intensification  of  a  given 
row  or  column  and  the  onset  of  the  intensification  of  the  next  row  or  column 
to  be  flashed  (Inter-stimulus  Interval  or  ISI),  and  half  with  a  500  msec 
IS  I .  As  before,  each  row  was  Intensified  for  100  msec,  and  then  the  each  of 
the  columns  was  Intensified  similarly.  Figure  2  Illustrates  the  time  course 
of  events  In  the  blocks  using  the  ISI  of  125  msec. 


Insert  Figure  2  About  Here 


The  relevant  data  consisted  of  the  EEG  digitized  for  600  msec  after  the 
onset  of  each  flash.  The  EEG  was  digitized  continuously  from  20  msec  prior 
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to  the  first  flash  in  each  trial  to  600  msec  after  the  sixth  flash.  The 
subsequent  trial  began  approximately  620  msec  after  the  sixth  flash.  The 
inter-trial  Intervals  (ITI)  measured  from  the  beginning  of  one  trial  to  the 
beginning  of  the  next  trial,  then,  were  1245  and  3120  msec  for  the  125  and 
500  msec  I SI s  respectively.  Note  the  distinction  between  Inter-stimulus 
interval  ( IS  I ) — the  time  from  the  onset  of  the  flash  of  one  row  or  column  to 
the  onset  of  the  flash  of  the  next  row  or  column— and  inter-trial  interval 
(ITI)— the  time  from  the  onset  of  one  trial  (6  row  or  column  flashes)  to  the 
onset  of  the  next  trial. 

Each  block  consisted  of  30  trials.  Five  blocks  at  125  msec  ISI  were 
followed  by  5  blocks  at  500  msec  ISI  for  2  of  the  subjects,  and  the  order 
was  reversed  for  the  other  2  subjects. 

Subjects  were  instructed  to  keep  a  running  mental  count  of  the  flashes 
of  the  letter  "B"  until  the  “CHOOSE  ONE  LETTER  OR  COMMAND"  instruction  (see 
Figure  1)  was  turned  off  for  500  msec  and  then  turned  back  on.  They  were 
then  to  count  the  letter  "R"  until  the  same  signal  appeared,  then  the  letter 
"A",  then  "I",  and  then  "N“.  Trials  with  muscle  or  EOG  artifact  were 
eliminated  In  the  same  manner  as  In  the  pllot/trainlng  session.  After  each 
30  uncontaminated  trials  were  accumulated,  the  "CHOOSE..."  instruction 
turned  off  for  500  msec,  and  then  a  new  block  began  with  the  next  letter  to 
be  attended.  After  5  blocks  at  one  ISI,  the  subjects  received  a  short 
break,  and  then  the  next  series  was  run. 

Thus,  In  effect,  the  subject  spelled  the  word  "BRAIN"  in  each  series  of 
5  blocks,  with  approximately  30  trials  for  each  letter.  (When  a  trial  was 
rejected  for  artifact.  It  was  not  recorded  for  Inclusion  in  the 
signal -detect Ion  computations,  and  an  additional  trial  was  presented. 
Therefore,  the  total  number  of  trials  in  each  block  viewed  by  the  subject 
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was  sometimes  a  few  more  than  30,  but  the  number  of  trials  recorded  was  30 
in  every  case.)  Note  that  each  trial  contained  6  distinct  events,  namely 
the  flashes  of  each  of  the  rows  or  columns,  only  one  of  which  was 
task-relevant. 

Oata  Analysis 

In  analyzing  the  data,  we  sought  to  determine  how  many  trials  were 
required  to  detect  the  letter  on  which  the  subject  was  focusing  at  different 
levels  of  accuracy,  for  each  of  4  different  detection  methods. 

For  analytic  purposes,  each  trial  was  divided  Into  6  data  windows  or 
subtrials,  each  consisting  of  the  data  for  600  msec  after  onset  of  the  flash 
of  a  row  or  column  (see  Figure  2).  Thus,  since  the  ISI  (i.e.,  the  time 
between  flashes)  was  less  than  600  msec,  these  subtrials  contained 
overlapping  data.  For  each  subtrial  we  computed  a  score  that  measured  the 
magnitude  of  the  P300  in  the  epoch  following  the  presentation  of  the  row  or 
the  column. 

Four  different  algorithms  were  used  to  compute  the  scores:  (a)  stepwise 
linear  discriminant  analysis  (SWOA),  (b)  peak  picking,  (c)  area,  and  (d) 
covariance.  We  will  briefly  describe  each  of  these  algorithms.  The 
Interested  reader  can  find  detailed  discussions  of  these  procedures  in 
Donchln  and  Heffley  (1975)  and  Coles  et  al.  (1986). 

A.  Stepwise  discriminant  analysis.  SWDA  Is  a  classification  procedure. 
In  the  present  case,  a  score  was  computed  that  reflects  the  "d1stance“ 
between  each  epoch  and  the  mean  of  a  group  of  trials  known  to  Include  a 
P300,  as  well  as  the  distance  from  the  mean  of  a  group  that  does  not  include 
a  P300.  This  measurement  was  performed  by  applying  a  discriminant  function 
to  the  data  from  the  epoch.  That  function  was  developed  on  the  basis  of  a 
"training  set"  of  trials  whose  group  membership  was  known.  The  ERPs  we 
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recorded  while  the  subject  was  focusing  on  the  first  2  letters  ("B"  and  "R") 
served  as  the  training  set  for  our  analysis.  The  remaining  ERPs  provided 
the  "analysis  set."  We  used  the  training  set  data  to  compute  discriminant 
weights  that  distinguished  between  the  attended  subtrials  (600  msec 
following  the  flash  of  a  row  or  column  containing  the  attended  cell)  and  the 
unattended  subtrials  (600  msec  following  the  flash  of  a  row  or  column  not 
containing  the  attended  cell).  These  weights  were  applied  to  Individual 
subtrials  in  the  analysis  set  and  sunned  across  trials  in  order  to  identify 
the  attended  cell  of  the  matrix. 

B.  Peak  picking.  The  amplitude  of  P300  was  defined  as  the  difference 
between  the  lowest  negative  point  prior  to  the  P300  window  (defined  as  the 
time  range  within  which  the  average  attended  waveform  In  the  training  set 
for  each  subject  was  positive)  and  the  highest  positive  point  in  the  P300 
window.  The  window  for  the  P300  ranged  typically  between  220  and  500  msec. 

C.  Area.  The  "area"  of  P300  was  calculated  as  the  sum  of  the  data 
points  in  the  P300  window  (as  defined  above). 

D.  Covariance.  A  P300  template  was  computed  as  the  average  of  the 
attended  subtrials  In  the  training  set  for  each  subject.  P300  scores  in  the 
analysis  set  were  derived  by  computing  the  covariance  of  each  subtrial  with 
this  template.  The  covariance  was  computed  using  all  of  the  points  In  the 
600-msec  epoch. 

The  values  attained  from  the  above  analyses  were  then  used  to  determine 
the  letter  upon  which  the  subject  was  focusing  attention.  Row  and  column 
scores  given  by  the  respective  algorithms  were  summed  to  compute  a  unique 
score  for  each  cell  In  each  pair  of  trials  (one  trial  In  which  rows  were 
flashed  and  one  trial  In  which  columns  were  flashed).  For  example,  the 
score  for  “B,"  which  Is  located  in  the  first  column  and  the  second  row  (see 
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Figure  1),  was  the  sum  of  the  score  for  the  first  column  and  the  score  for 
the  second  row.  (By  "first"  and  "second"  here  we  refer  to  the  spatial 
position  in  the  matrix,  and  not  the  temporal  position  in  the  sequence  of  row 
or  column  flashes.  Since  flashes  were  in  random  order,  the  "first"  column 
would  be  flashed  first  only  approximately  one-sixth  of  the  time.) 

The  scores  computed  for  each  letter  were  summed  across  trials  to 
determine  which  cell  was  identified  as  the  cell  selected  by  the  subject. 

Each  test  could  yield  one  correct  response  or  one  of  35  possible  errors. 

The  test  was  considered  a  “hit"  if  the  algorithm  yielded  the  largest  total 
score,  summed  across  trials,  for  the  letter  on  which  the  subject  was 
focusing.  For  example,  If  the  subject  was  attending  to  the  letter  "B"  and  6 
trials  were  being  considered  In  the  analysis,  a  correct  response  would  be 
achieved  if  the  total  of  the  6  "B"  scores--the  scores  for  the  rows  and 
columns  containing  "B"--wa$  greater  than  the  total  of  the  6  scores  for  any 
other  cell  in  the  matrix. 

RESULTS 

The  principal  aim  In  this  phase  of  the  study  was  to  derermine  the  speed 
with  which  the  letter  on  which  the  subject  Is  focusing  can  be  determined, 
given  the  detection  technique  employed  for  analyzing  the  trials.  A  rather 
distinct  P300  is  elicited  by  the  correct  letter,  as  can  be  seen  in  Figure  3. 
This  figure  presents  ERP  responses  to  Intensifications  of  attended  or 
correct  letters  and  of  unattended  letters,  averaged  across  all  trials  for 
each  subject. 
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Insert  Figure  3  About  Here 


If  the  technique  required  the  presentation  of  30  trials  for  correct 
detection,  however.  It  would  be  quite  11m1ted--the  rate  of  Information 
transmission  would  be  slower  than  one  character  per  one  and  one-half  minutes 
at  an  ISI  of  500  msec.  We  hypothesized  that  the  transmission  rate  could  be 
speeded,  however,  by  shortening  the  ISI  and  overlapping  the  data-col lection 
epochs.  We  knew  from  previous  research  that  the  rate  could  be  Increased  by 
taking  advantage  of  the  fact  that  the  P300  can  be  detected  using  a 
substantially  smaller  number  of  trials  (Squires  and  Donchln,  1976).  We 
examined,  therefore,  the  accuracy  of  detection  of  the  attended  letter  as  a 
function  of  the  number  of  trials  at  each  ISI  for  each  of  the  4  detection 
algorithms.  Oetectlon  accuracy  was  estimated  by  means  of  an  iterative 
sampling  technique  akin  to  bootstrapping  (Efron,  1979).  Bootstrapping 
provides  an  estimate  of  a  parameter  In  the  absence  of  adequate  data  on  Its 
sampling  distribution  by  obtaining  many  random  sub-samples  from  the 
available  data  and  computing  the  parameter  afresh  for  each  of  these 
sub-samples.  The  distribution  of  these  values  approximates  the  actual 
distribution. 

We  randomly  chose  1000  sets  of  2  trials,  1000  sets  of  4  trials,  and  so 
on  up  to  1000  sets  of  40  trials  from  the  analysis  set.  (Recall  that  the 
analysis  set  consisted  of  30  trials  for  each  of  3  letters,  a  total  of  90 
trials.)  The  sampling  was  with  replacement.  We  applied  the  4 
signal -detection  algorithms,  computed  scores  for  each  of  the  36  stimuli,  and 
determined  how  many  times  out  of  1000  the  stimulus  that  the  subject  was 
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attending  had  the  highest  score  with  each  algorithm  at  each  number  of  trials 
considered  in  the  analysis.  This  provided  an  estimate  of  the  percent  of 
correct  identifications  of  the  chosen  stimulus  out  of  the  36  presented,  as  a 
function  of  the  number  of  trials  considered  in  the  analysis.  By  multiplying 
by  the  inter-trial  Interval,  we  obtained  an  estimate  of  the  accuracy  of  each 
algorithm  as  a  function  of  time. 


Insert  Figure  4  About  Here 


Figure  4  Illustrates  the  Iterative-sampling  analysis  procedure.  This 
figure  presents  the  results  of  only  one  Iteration,  in  each  of  2  specific 
cases.  For  Illustrative  purposes,  we  have  included  one  set  of  data  where 
the  discrimination  between  the  attended  and  unattended  stimuli  Is  quite 
clear,  and  another  set  where  the  discrimination  is  much  less  clear.  In 
these  figures  we  plot  the  scores  assigned  to  each  of  the  36  letters  by  the 
application  of  one  detection  technique:  a)  SWOA  scores  for  subject  3  in  the 
500  msec  condition,  and  b)  covariance  scores  for  subject  2  In  the  500  msec 
ISI  condition.  The  scores  we  plotted  for  each  of  the  sample  sizes  assigned 
to  the  correct  character  can  be  distinguished  from  scores  assigned  to 
characters  sharing  either  a  row  or  a  column  with  the  correct  character  as 
well  as  from  entirely  Incorrect  characters. 

Thus,  the  36  scores  at  each  sample  size  fall  Into  3  groups:  1)  one 
"attended"  letter  (solid  line);  2)  10  unattended  letters  that  share  a  row  or 
a  column  with  the  attended  letter--and  therefore  are  flashed  at  the  same 
time  as  the  attended  letter  either  when  flashed  with  their  row  or  with  their 
column  (dotted  and  chained  lines);  and  3)  25  other  unattended  letters--wh1ch 
never  are  flashed  at  the  same  time  as  the  attended  letter  (dashed  lines). 
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Note  that  the  highest  score  was  generally  obtained  for  the  attended 
cell;  and,  as  might  be  predicted,  the  scores  for  unattended  cells  that  were 
In  the  same  column  or  the  same  row  with  the  attended  cell  were  higher,  in 
general,  than  the  scores  for  the  other  unattended  cells.  The  decision 
algorithm  is,  of  course,  "correct"  whenever  the  attended  letter  is  assigned 
the  highest  score.  It  can  be  seen  that  this  Is  generally  the  case.  It  can 
also  be  seen  that  there  Is  considerable  individual  variance.  It  Is  evident 
In  Figure  4  that  as  the  number  of  trials  increases  the  expected  reduction  In 
signal-to-nolse  ratio  yields  greater  accuracy:  the  larger  the  number  of 
trials,  the  more  often  the  correct  letter  is  assigned  the  highest  score,  and 
the  larger,  In  general,  the  difference  between  the  score  assigned  to  the 
correct  letter  and  the  scores  assigned  to  the  incorrect  letters. 

For  each  of  the  4  subjects  we  obtained  1000  data  sets  similar  to  those 
illustrated  in  Figure  4,  and  analyzed  each  data  set  with  each  of  4  detection 
algorithms.  It  is  the  total  data  set  (including  all  1000  of  these  smaller 
data  sets)  that  was  analyzed  as  we  assessed  the  accuracy  of  this 
comnunicatlon  channel  as  a  function  of  Its  speed. 

As  we  noted,  the  analysis  procedure  was  repeated  1000  times  for  each 
combination  of  sample  size  (2-40  trials)  and  algorithm  (SWDA,  peak  picking, 
area,  and  covariance),  at  each  ISI  (125  and  500  msec).  In  each  of  the  1000 
iterations,  for  each  sample  size,  we  picked  a  new  random  sample  of  the 
trials  for  Inclusion  in  the  analysis.  For  a  particular  iteration,  at  any 
given  sample  size,  each  analysis  procedure  provided  either  a  correct  or  an 
Incorrect  determination.  We  tallied  the  number  of  correct  determinations 
for  each  sample  size,  for  each  subject,  at  each  ISI,  employing  each  analysis 
algorithm. 
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Insert  Figure  5  About  Here 


In  Figure  5  we  plot  the  proportion  of  correct  decisions  out  of  of  the 
1000  iterations  of  the  procedure  at  each  sample  size.  To  allow  comparison 
of  the  data  obtained  with  the  2  IS  Is  and  to  facilitate  the  evaluation  of  the 
speed  of  the  system,  the  percent  of  correct  Identifications  of  the  attended 
letter  Is  plotted  against  the  time  required  to  present  a  given  number  of 
trials,  rather  than  against  the  number  of  trials.  The  Interval  therefore  Is 
equal  to  the  number  of  trials  times  the  Inter-trial  Interval.  As  can  be 
seen  in  Figure  5,  there  are  considerable  Individual  differences  In  the 
subjects'  ability  to  use  the  system,  as  well  as  In  the  relative 
effectiveness  of  the  different  detection  algorithms.  Moreover,  different 
algorithms  were  more  effective  for  different  subjects.  All  of  the  subjects, 
however,  were  able  to  achieve  a  high  level  of  accuracy  in  comnuni eating 
their  choices  to  the  system  at  a  speed  of  some  seconds  per  choice. 
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Insert  Table  I  about  here 


Table  I  presents  speed  and  accuracy  figures  for  the  fastest  algorithm 
for  each  subject  at  each  ISI.  When  the  subjects'  optimal  IS  I  and 
signal-detection  algorithm  were  used,  the  mean  time  required  to  achieve  80% 
accuracy  of  determination  of  the  one  stimulus  out  of  36  that  the  subject  was 
attending  was  20.9  seconds  (Table  la).  For  95%  accuracy,  the  mean  time 
required  was  26.0  seconds  (Table  lb).  A  choice  of  one  out  of  36  contains 
5.2  bits  of  Information,  so  the  speed  at  95%  accuracy  was  0.20  bits  per 
second,  or  12.0  bits  per  minute.  By  using  the  "BKSP"  (backspace)  command 
(see  Figure  1)  with  the  same  speed  and  accuracy,  a  subject  could  correct 
errors  and  achieve  over  99.9%  accuracy  with  a  speed  of  0.18  bits  per  second, 
or  10.8  bits  per  minute. 

SWDA  and  peak  picking  proved  to  be  the  most  efficient  algorithms.  At 
125  msec  ISI,  SWDA  was  the  fastest  algorithm  to  reach  both  80%  and  95% 
accuracy  in  3  out  of  4  cases.  At  500  msec  ISI,  peak  picking  was  fastest  to 
reach  both  80%  and  95%  accuracy  In  3  out  of  4  cases.  (A  possible  explanation 
of  this  difference  In  algorithm  effectiveness  as  a  function  of  ISI  is 
discussed  below.)  When  considering  the  4  subjects,  2  IS  Is ,  and  2  accuracy 
criteria  (80%  and  95%),  SWDA  yielded  the  fastest  times  to  reach  the  accuracy 
criterion  In  8  cases  out  of  16,  and  peak  picking  In  6  cases.  Area  and 
covariance  were  each  fastest  In  one  case. 


Insert  Table  II  about  here 


Table  II  shows  the  times  taken  by  each  of  the  4  algorithms  to  reach  80% 
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and  95%  accuracy,  for  each  subject  at  each  I  SI .  As  shown  In  Table  II  and 
Figure  5,  different  signal-detection  algorithms  were  more  effective  for 
different  subjects.  This  Is  a  result  of  differences  In  the  characteristic 
ERPs  for  different  subjects  and  differences  in  the  information  utilized  by 
the  algorithms. 

Discussion 

This  study  addressed  two  distinct  questions.  We  sought  to  determine 
if  it  Is  Indeed  the  case  that  the  P300  can  be  employed  as  a  switch  by  means 
of  which  the  subject  can  toggle  a  choice.  This  question  Is  clearly  answered 
In  the  affirmative.  Indeed,  the  specific  arrangement  we  used  to  present 
choices  to  the  subject  amplifies  the  power  of  the  the  P300  to  act  as  a 
binary  switch,  as  the  series  of  choices  allows  for  the  reliable 
Identification  of  one  choice  among  36  distinct  objects.  In  principle,  this 
method  can  be  used  In  a  manner  that  would  allow  for  a  choice  among  more 
Items,  as  the  number  of  rows  and  columns  can  be  Increased.  However,  such  an 
Increase  would  entail  a  cost  In  that  the  total  number  of  flashes  required 
for  each  choice  would  be  Increased.  The  optimal  size  of  the  matrix  remains 
a  matter  for  further  Investigation. 

The  answer  to  this  first  question  was  not  entirely  surprising.  There 
Is  by  now  an  extensive  literature  that  establishes  the  reliability  with 
which  the  P300  Is  elicited  by  rare,  task-relevant  events  within  the 
framework  of  the  oddball  paradigm.  It  Is  quite  clear  that  almost  any 
arrangement  that  would  Impose  a  categorization  on  a  series  of  events, 
however  abstract  the  categorization,  can  be  used  to  elicit  sizeable  P300s 
provided  the  two  categories  are  presented  In  a  Bernoulli  sequence,  that  the 
stimuli  play  an  Important  role  In  the  subject's  Information  processing,  and 
that  one  of  the  categories  occurs  with  a  somewhat  lower  frequency  (See 
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Fabianl  et  al.,  in  press,  for  a  discussion  of  the  varieties  of  the  oddball 
paradigm).  Note  that  even  though  Improbable  events  do  tend  to  elicit  a  P300 
with  ease,  rarity  Is  neither  a  necessary  nor  a  sufficient  condition  for  the 
elicitation  of  the  P300.  In  any  event,  our  data  do  confirm  that  the  P300 
can  be  used  as  a  communication  channel  by  taking  advantage  how  It  responds 
to  task-relevant  events  In  the  oddball  paradigm,  as  used  In  the  arrangement 
described  above. 

There  Is,  however,  a  second  question  whose  answer  was  by  no  means  self- 
evident.  The  utility  of  communication  channel  based  on  the  P300  depends,  as 
do  all  communication  channels,  on  the  slgnal-to-nolse  ratio.  It  Is  evident 
that  the  P300  on  which  this  channel  Is  based  Is  burled  In  the  "polyneural 
roar  of  the  EEG,"  to  use  Ross  Adey's  felicitous  phrase.  The  detection  and 
measurement  of  the  P300,  as  is  true  for  other  ERP  components,  requires 
signal  averaging.  Thus,  It  was  conceivable  that  while  the  P300  can,  in 
principle,  serve  as  a  switch,  its  reliability  under  the  slgnal-to-noise 
conditions  which  It  presents  would  have  been  quite  Impractical  for  actual 
use.  Our  main  purpose  In  this  study,  then,  was  to  examine  the  operating 
characteristics  of  the  communication  channel. 

The  prime  task  of  the  channel  Is  to  communicate  the  choice  the  subject 
has  made  among  the  36  options.  Thus,  the  performance  index  for  the  channel 
Is  the  accuracy  with  which  this  choice  Is  communicated,  as  a  function  of  the 
speed  with  which  the  channel  operates.  The  speed  Is  controlled  by  the  rate 
at  which  the  stimuli  are  presented.  The  accuracy  Is  controlled  by  the 
efficiency  of  the  slgnal-to-nolse  reduction  achieved  by  the  detection 
algorithms.  It  Is  for  this  reason  that  we  used  as  Independent  variables  the 
Inter-stimulus  interval  within  each  trial  and  the  various  detection 
procedures. 
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The  conclusions  are  quite  clear.  The  channel  can  operate  reasonably 
well  at  the  speed  of  12  bits  per  minute.  A  character,  chosen  from  among  36 
Items,  can  be  detected  with  95%  accuracy  within  26  seconds. 

The  inter-stimulus  Interval  proves  to  be  an  important  variable.  To 
obtain  accurate  discrimination  of  the  attended  stimulus,  a  certain 
signal-to-nolse  reduction  Is  required.  This  can  be  achieved  by  Increasing 
the  interval  between  stimuli  from  125  to  500  msec,  allowing  for  a  better 
definition  of  the  P300.  Alternately,  the  signal-to-nolse  reduction  can  be 
achieved  by  an  Increase  In  the  number  of  trials.  Which  of  these  methods  Is 
more  effective  depends  on  the  subject  and  the  signal  detection  algorithm. 

The  various  tables  and  figures  reviewed  above  show  that  different 
detection  methods  varied  In  their  effectiveness  when  applied  to  the  data  of 
the  different  subjects.  The  differences  In  effectiveness  are  due  to  an 
interaction  between  the  nature  of  the  procedures  and  the  specific  attributes 
of  the  subject's  data.  It  Is  useful  to  consider  the  differences  among  the 
detection  algorithms. 

Comparison  of  the  different  algorithms 

Covariance  computes,  essentially,  how  similar  the  Individual  ERPs  are 
to  a  template  consisting  of  the  average  waveform  for  the  attended  cell  In 
the  training  set.  All  time  points  are  Included,  and  each  point  Is  weighted 
according  to  the  mean  amplitude  of  that  point  In  the  training  set. 

SWOA  Involves  much  more  extensive  computations  on  the  training  set  data 
than  covariance,  but  It  Is  In  general  more  efficient  because  It  gives 
greater  weight  to  time  points  that  were  more  effective  In  distinguishing 
between  attended  and  unattended  cells  In  the  training  set. 

The  primary  weakness  of  both  SWDA  and  covariance  is  sensitivity  to 
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latency  variability.  If  an  ERP  component,  such  as  P300,  appears  In  a  given 
trial  with  much  longer  or  shorter  latency  than  the  modal  latency  in  the 
training  set,  then  the  discriminant  weights  (or,  similarly,  the  weights  In 
the  covariance  algorithm)  will  not  be  applied  to  the  points  that  best 
characterize  the  P300,  and  accuracy  will  be  lost.  Latency  jitter  during  the 
training  set,  also,  will  add  noise  to  the  system  and  result  in  less 
effective  weights. 

Peak  picking,  on  the  other  hand,  Is  highly  insensitive  to  latency 
variability.  The  P300  peak  can  be  located  anywhere  In  a  relatively  wide 
time  window.  All  of  the  Information  contained  in  the  other  points,  however, 
is  lost  by  this  procedure.  Moreover,  at  a  short  ISI,  Insensitivity  to 
latency  variability  becomes  a  weakness  Instead  of  a  strength.  Since  the 
peak  picking  algorithm  locates  a  maximum  value  at  any  point  within  a 
considerable  range.  It  Is  susceptible  to  falsely  attributing  a  P300  peak 
generated  by  a  previous  or  subsequent  flash  to  the  stimulus  being 
considered.  This  fact  undoubtedly  accounts  for  a  large  part  of  the 
interaction  between  algorithm  and  ISI  shown  In  Tables  I  and  II,  where  peak 
was  the  most  accurate  algorithm  at  500  msec  ISI  and  the  one  of  the  least 
accurate  at  125  msec. 

The  area  analysis  algorithm,  like  the  covariance  algorithm,  considers 
all  of  the  points  In  a  broad  range,  but  It  is  a  purely  additive,  rather  than 

multiplicative,  procedure,  and  does  not  use  a  training  set.  Therefore  It 

/ 

misses  some  Information  contained  In  a  consistent,  distinctive  ERP  shape  and 
time  course,  but  also  avoids  some  of  the  noise  Introduced  Into  SWDA  and 
covariance  by  variability  In  the  time  course  and  shape  of  ERPs.  It  takes 
advantage  of  Information  contained  In  a  broad,  flat  ERP  that  Is  lost  In  the 
peak  picking  algorithm,  but  by  the  same  token  Is  Influenced  by  noise  at 
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points  at  a  distance  from  the  peak. 

Because  of  these  differences,  different  algorithms  are  more  effective 
in  different  cases.  For  a  subject  whose  P3G0s  have  a  distinct  peak  with 
considerable  latency  variability,  peak  picking  Is  likely  to  be  the  most 
efficient  algorithm,  at  least  when  a  relatively  long  ISI  is  used.  For  a 
subject  whose  ERPs  have  any  distinctive  shape  and  little  latency 
variability,  SWDA  Is  likely  to  be  the  most  efficient.  For  a  subject  whose 
P300s  tend  to  be  broad  and  flat,  without  much  of  a  peak  and  with 
considerable  latency  variability,  area  is  likely  to  be  the  most  efficient. 

In  a  clinical  application,  data  such  as  the  data  reported  here  could  be 
collected  and  analyzed,  and  the  optimal  algorithm  and  timing  parameters  for 
the  individual  could  be  determined  and  utilized  for  real-time  signal 
detection. 

Neither  of  the  ISIs  was  clearly  superior.  In  several  cases,  shortening 
the  ISI  from  500  to  125  msec  resulted  In  a  nearly  equal  trade-off  between 
shorter  trials  and  a  greater  number  of  trials  required  to  reach  a  given 
accuracy  criterion.  The  signal -to-nolse  ratio  decreased  with  shorter  trials 
because  there  was  more  overlap  between  the  data  contained  in  successive 
subtrials  (Figure  2),  and  consequently  more  contamination  In  the  600  msec 
analysis  epoch  after  each  flash  of  a  row  or  column  with  ERP  components 
related  to  the  flashes  of  other  rows  or  columns. 

Conclusions,  applications,  and  potential  Improvements 

The  above  differences  notwithstanding,  the  general  conclusion  is 
sustained  by  the  data.  It  is  quite  possible  to  use  the  P300  as  an  effective 
communication  switch,  and  the  communication  channel  can  be  organized  so  that 
the  choices  can  be  communicated  using  a  relatively  small  number  of  trials. 
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We  can  assume  for  the  rest  of  this  discussion  that  the  characters  can  be 
conmunlcated  with  reliability  at  the  rate  of  one  character  every  26  seconds, 
or  2.3  characters  per  minute. 

This  is,  of  course,  rather  a  low  rate  for  a  communication  channel. 

Even  a  slow  typist  can  type  150  characters  per  minute.  Voice  communication 
is  even  faster.  However,  it  Is  equally  clear  that  when  no  other  channel  is 
available  because  the  skeleto-musculature  Is  completely  disabled,  the 
ability  to  comnunicate  even  at  the  rate  of  a  few  characters  per  minute  would 
be  most  welcome. 

The  utility  of  this  approach  for  cases  of  such  severe  disability 
demands  further  research.  We  have  recently  Initiated  a  series  of  studies  to 
determine  the  feasibility  of  utilizing  this  technuque  with  patients  who  are 
Incapable  of  speech  or  other  motor  system  output. 

In  addition  to  clinical  applications  In  cases  of  severe,  permanent 
disability,  there  may  be  other  cases  In  which  a  very  short  term  disability 
may  require  a  dependence  on  such  a  prosthesis.  Examples  of  this  might  be 
burn  victims  and  Individuals  In  the  early  stages  of  recovery  from  severe 
trauma. 

The  value  of  the  P300  channel  may  be  further  enhanced  If  the  procedure 
Is  used  as  a  method  for  choosing  from  a  menu  of  commands  rather  than  as  a 
method  for  spelling  words.  The  elements  in  the  matrix  may  well  be  words 
such  as  "Nurse,"  "Water,"  "Pain,"  or  "Dinner."  Each  of  these  choices  may  In 
turn  call  for  another  menu.  In  such  a  paradigm  the  rate  of  conmunicatlon 
would  be  enormously  amplified,  even  though  the  domain  of  the  communication 
would  be  constricted.  Furthermore,  the  communication  speed  we  have  assessed 
In  this  study  examined  the  channel  without  any  attempt  to  benefit  from  a 
number  of  obvious  procedures  for  accelerating  the  communication.  As  a 
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computing  device  must  be  a  part  of  the  system,  it  Is  relatively  trivial  to 
Incorporate  In  the  channel  the  known  constraints  of  the  language.  With  each 
letter  presented  the  number  of  actual  options  Is  reduced,  as  combinations  of 
characters  appear  with  quite  uneven  probabilities  In  English.  The  system 
may  be  allowed  to  "guess"  so  that,  for  example,  having  detected  a  "TH"  pair 
it  can  be  relatively  sure  the  following  character  would  be  one  of  the 
vowels. 

It  may  also  be  possible  to  enhance  the  speed  of  the  system  by 
Incorporating  additional  components  of  the  ERP.  If,  for  example,  we  were  to 
present  the  rows  and  columns  In  a  regular  sequence,  one  would  expect  to  see 
a  CNV  develop  as  the  time  for  the  appearance  of  the  correct  column,  or  row, 
neared.  The  relative  effectiveness  of  a  random  presentation  utilizing  the 
P300  solely  and  a  presentation  that  capitalized  on  both  a  CNV  and  the  P300 
Is  a  matter  for  further  research. 

The  procedures  we  describe  In  this  paper  and  the  data  we  adduce  serve 
to  illustrate  the  feasibility,  and  the  limitations,  of  the  "blocybernetlc" 
concept.  The  term  "biocybernetics"  has  been  used  to  describe  an  attempt 
sponsored  during  the  1970' s  by  DARPA  to  develop  a  "blocybernetlc"  channel. 
That  channel  was  to  enhance  the  communication  between  people  and  machines  by 
adding  channels  of  communication  that  employed  psychophyslological  means. 
Several  approaches  were  proposed  (see  Gomer  et  al.,  1979).  There  were 
several  attempts  to  use  the  ERP  as  a  switch.  Vidal  and  his  associates  have, 
for  example,  used  the  differences  between  the  responses  to  different 
checkerboards  which  flashed  on  different  parts  of  the  screen  to  create  an 
EEG-drlven  joystick  that  controlled  the  movements  of  a  displayed  "mouse" 
(Hickman  and  Vidal,  1976).  Oonchln  and  his  colleagues  developed  within  the 
framework  of  the  blocybernetlc  program,  and  In  subsequent  work,  the  use  of 
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the  P300  as  an  index  of  mental  workload  (see  Donchin  et  al.,  1986  for  a 
review).  In  the  assessment  of  workload,  however,  the  P300  is  used  as  a 
metric  rather  than  as  a  switch. 

A  caveat  may  be  in  order.  The  blocybernetlc  concept  has  often  been 
mistaken,  especially  In  the  popular  press,  as  an  attempt  to  use  the  computer 
to  "read  the  mind"  of  a  subject.  Here,  too.  It  is  possible  to  be  deceived 
by  the  appearance  of  a  subject  "writing  to  the  screen"  or  "speaking  through 
the  computer"  with  the  brain  waves.  One  may  be  tempted  to  see  this  as  a 
direct  communication  between  the  computer  and  the  mind  that  somehow 
by-passes  the  control  people  have  over  the  Inner  workings  of  their  minds. 
Such  an  innovation  would  be  treated  with  dismay  by  many,  and  with  glee  by 
some.  We  emphasl2e  that  this  paper  does  not  report  the  development  of  a 
means  by  which  one  can  eavesdrop  on  the  mind  (see  Donchin,  In  press). 

The  procedure  we  describe  above  accomplishes  no  more  than  to  provide 
the  subject  with  a  switch  that  can  be  wielded  at  the  subject's  discretion. 
The  recording  would  be  of  no  use  whatsoever  if  the  subject  chose  to  ignore 
our  instructions  and  to  focus  attention  elsewhere  In  the  environment. 
Furthermore,  the  probes  we  attach  to  the  head  record  signals  that  can  be 
interpreted  solely  within  the  framework  of  the  stimulus  arrangement  we  have 
provided.  Thus,  there  Is  no  more  "mind  reading"  In  the  procedures  we 
describe  than  there  Is  when  a  person  Is  handed  a  pencil  and  asked  to  record 
Impressions.  The  contents,  and  the  reliability,  of  the  information  obtained 
will  depend  to  a  degree  on  the  sharpness  of  the  pencil;  but  the  subject's 
willingness  to  report  and  the  accuracy  of  these  reports  will  be  the  factors 
that  ultimately  determine  the  utility  of  the  communication.  We  report  here 
that  the  P300  can  serve  as  a  pencil,  and  that  the  pencil  Is  actually  rather 
sharp.  The  mind,  however,  retains  control  over  the  use  of  the  pencil. 
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FIGURE  CAPTIONS 

Figure  1 

CRT  display  used  In  the  Mental  Prosthesis.  The  rows  and  columns  of  the 
matrix  were  flashed  alternately.  The  letters  selected  by  the  subject  ("B-R- 
A-I-N")  were  displayed  at  the  top  of  the  screen  in  the  pilot  study. 

Figure  2 

Time  course  of  events  In  the  blocks  using  the  ISI  of  125  msec.  Six  columns 
(or  6  rows)  were  Intensified  (“flash")  In  a  random  sequence  for  100  msec,  at 
125  msec  Intervals  (ISI).  EEG  was  recorded  from  20  msec  prior  to  the  first 
flash  until  600  msec  after  the  last  flash.  Each  flash  served  as  the  onset 
of  a  600  msec  analysis  epoch  ("subtrial").  A  trial  comprised  6  flashes--one 
flash  of  each  row  or  one  flash  of  each  column— plus  the  associated  data 
collection  time,  a  total  of  1245  msec. 

Figure  3 

Average  waveforms  for  attended  vs.  unattended  cells  for  each  of  the  4 
subjects.  A)  500  msec  ISI;  B)  125  msec  ISI. 

Figure  4 

Examples  Illustrating  the  comparison  of  scores  generated  by  the 
analysis  algorithms,  with  different  numbers  of  trials  Included  In  the 
analysis.  This  figure  provides  an  example  of  the  data  that  went  Into  the 
speed/ accuracy  calculations,  and  Is  provided  for  the  sake  of  Illustration. 

In  this  figure  we  plot  A)  SWDA  scores  for  subject  3  In  the  500  msec 
condition  and  B)  Covariance  scores  for  subject  s  In  the  500  msec  ISI 
condition.  Each  line  represents  the  score  obtained  for  one  letter,  as 
sample  size  was  varied  systematically  from  2  to  40  trials  per  sample. 

Scores  fall  Into  3  groups:  1)  The  "attended"  subtrials  (solid  line), 
which  begin  with  a  flash  of  the  attended  letter  (and  Its  row  or  column);  2) 
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unattended  subtrials  that  begin  with  the  flash  of  a  letter  that  shares  a  row 
or  column  with  the  attended  letter  (dotted  and  chained  11nes)--of  course, 
the  row  or  column  containing  this  unattended  letter  flashes  as  well;  and  3) 
other  unattended  subtrials  (dashed  lines).  A  correct  Identification  was 
made  whenever  the  attended  stimulus  received  the  highest  score. 

Note  that  this  figure  Includes  data  from  only  one  Iteration  of  the 
analysis  procedure,  l.e..  It  represents  only  one  random  sample  of  each  size 
and  therefore  Includes  only  one  score  for  each  letter  at  each  sample  sl2e. 
Speed/accuracy  results  (see  Figure  5)  were  obtained  by  repeating  this 
sampling  and  analysis  procedure  1000  times  for  each  sample  size  and  tallying 
the  proportion  of  Iterations  In  which  the  attended  cell  resulted  In  the 
highest  score  at  each  sample  size. 

It  can  be  seen  that,  in  these  particular  samples,  the  attended  cell  did 
generally  achieve  the  highest  score.  It  can  also  be  seen  that  there  was 
considerable  Individual  variance. 

Figure  5 

Graphs  of  the  accuracy  of  each  of  the  4  algorithms  In  identifying  the 
attended  stimulus,  as  a  function  of  the  number  of  trials  considered  In  the 
analysis.  Each  of  the  graphs  presents  the  Increase  In  the  accuracy  of  the 
algorithm  as  sample  size  Increases.  To  facilitate  the  comparison  of  the  125 
msec  and  500  msec  ISIs  and  the  analysis  of  the  speed  of  the  system,  the 
number  of  trials  has  been  transformed  to  the  time  required  to  present  the 
trials.  Time  Is  the  product  of  the  number  of  trials  considered  and  the 
duration  of  a  single  trial  (1TI).  Accuracy  Is  the  percent  of  correct 
Identifications  of  the  attended  cell  out  of  1000  Iterations  of  one  of  the  4 
algorithms. 
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TABLE  I 

Times  required  to  obtain  a)  BOX  and  b)  95X  accuracy,  by  subject  and  IS  I . 
using  the  fastest  algorithm  In  each  case.  Accuracy  Is  the  percent  of 
correct  Identifications  of  the  attended  cell  out  of  1000  Iterations  of  tne 
signal -detect Ion  algorithm.  Time  Is  the  product  of  the  number  of  trials 
considered  and  the  duration  of  a  single  trial  ( ITI ) .  (Times  have  been 
Interpolated  when  the  least  number  of  trials  required  to  reach  80X  or  95X 
accuracy  not  only  reached  but  In  fact  exceeded  that  level.)  Times  for  the 
fastest  combination  of  an  algorithm  and  an  ISI  for  each  subject  are  followed 
by  2  asterisks  (**). 
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TABLE  II 

Times  required  to  obtain  a)  80*  and  b)  95*  accuracy,  by  subject  and  ISI  for 
the  4  signal-detection  algorithms.  Accuracy  Is  the  percent  of  correct 
determinations  of  the  attended  cell  out  of  1000  Iterations  of  the  algorithm. 
Time  Is  the  product  of  the  number  of  trials  considered  and  the  duration  of  a 
single  trial  (ITI).  (Times  have  been  Interpolated  when  the  least  number  of 
trials  required  to  reach  80*  or  95*  accuracy  not  only  reached  but  In  fact 
exceeded  that  level.)  Times  for  the  fastest  algorithm  at  each  ISI  for  each 
subject  are  followed  by  an  asterisk  (*).  Times  for  the  fastest  combination 
of  an  algorithm  and  an  ISI  for  each  subject  are  followed  by  2  asterisks 
(**).  Cases  where  the  algorithm  did  not  result  In  80*  (a)  or  95*  (b) 
accuracy  after  80  trials  are  Indicated  by  "X.“ 


Mental  Prosthesis 


38 


FOOTNOTES 

1.  The  research  at  the  Cognitive  Psychology  Laboratory  described  was 
supported  In  part  by  AFOSR  contract  IF49620-79-C-0233.  The  support  of  A1 
Fregly  Is  appreciated. 
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the  onset  of  a  600  msec 
analysis  epoch  ("subtrial") 


ERPs  for  Attended  and  Unattended  Cells 
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TABLE  I 

SPEED/ACCURACY:  FASTEST  ALGORITHMS 


A)  80%  Accuracy 

B)  95%  Accuracy 

ISI  (msec)  125 

500 

125 

500 

Time  (sec)  Algo. 

Time  Algorithm 

Time  Algorithm 

Time  Algorithm 

Subj.  # 

1  15.7“  SWDA 

28 2  Peak 

21.6“  SWDA 

425  Peak 

'  2  33.4  SWDA 

233“  Peak 

57.5  SWDA 

355“  Peak 

3  22.3  SWDA 

11.1“  SWDA 

46.4  SWDA 

175”  SWDA 

4  36.7  Cov 

17.7“  Peak 

64.0  Area 

293”  Peak 

1  Mean  27.0 

20.1 

47.4 

31.2 

Mean  time  to  80%  accuracy, 

Mean  time  to  95%  accuracy, 

fastest  ISI  and  Algorithm 

fastest  ISI  and  Algorithm 

]  for  each  subject 

for  each  subject 

20.9 

26.0 

Times  required  to  obtain  a)  80%  and  b)  95%  accuracy,  by  subject  and  ISI.  using 
the  fastest  algorithm  in  each  case.  Accuracy  is  the  percent  of  correct  identifica¬ 
tions  of  the  attended  cell  out  of  1 000  iterations  of  the  signal-detection  algorithm. 
Time  is  the  product  of  the  number  of  trials  considered  and  the  duration  of  a  single 
trial  (ITI)  (Times  have  been  interpolated  when  the  least  number  of  trials  required 
to  reach  80%  or  95%  accuracy  not  only  reached  but  in  fact  exceeded  that  level.) 
Times  for  the  fastest  combination  of  an  algorithm  and  an  ISI  tor  each  subject  are 
followed  by  two  asterisks  (**). 


TABLE  H 

SPEED/ACCURACY:  4  ALGORITHMS 


A)  80%  Accuracy  B)  95%  Accuracy 


PEAK  PICKING 


Subj.  #  Time  to  80%  Accuracy 

Time  to  95%  Accuracy 

125  msec  ISI 

500  msec  ISI 

125  msec  ISI 

500  msec  ISI 

1  X 

28.2* 

X 

42.5* 

2  X 

23.3** 

X 

35.5** 

3  39.8 

17.3 

X 

26.0 

4  38.8 

17.7“ 

70.4 

'  29.3” 

SWDA 


Subj.  # 

Time  to  80%  Accuracy 

Time  to  95%  Accuracy 

125  msec  ISI 

500  msec  ISI 

125  msec  ISI 

500  msec  ISI 

1 

15.7” 

114.8 

21.6” 

202.8 

2 

33.4* 

56.9 

57.5* 

X 

3 

22.3* 

11.1” 

46.4 

17.6” 

4 

54.4 

26.7 

X 

49.5 

AREA 

Subj.# 

Time  to  80%  Accuracy 

Time  to  95%  Accuracy 

125  msec  ISI 

500  msec  ISI 

125  msec  ISI 

500  msec  ISI 

1 

29.1 

39.9 

76.7 

59.3 

2 

49.0 

56.6 

X 

X 

3 

29.3 

12.6 

55.8 

17.9 

4 

45.5 

44.9 

82.2 

52.9 

COVARIANCE 


Subj .  #  Time  to  80%  Accu  racy 

Time  to  95%  Accuracy 

125  msec  ISI 

500  msec  ISI 

125  msec  ISI 

500  msec  ISI 

1  X 

42.9 

X 

62.4 

2  X 

X 

X 

X 

3  41.8 

15.5 

82.2 

22.6 

4  36.7 

28.6 

64.0 

52.0 

Times  required  to  obtain  a)  80%  and  b)  95%  accuracy,  by  subject  and  ISI  lor 
the  tour  signal-detection  algorithms.  Accuracy  is  the  percent  of  correct 
determinations  of  the  attended  cell  out  o(  1 000  iterations  of  the  algorithm. 

Time  is  the  product  of  the  number  of  trials  considered  and  the  duration  of  a 
single  trial  (ITI).  (Times  have  been  interpolated  when  the  least  number  of 
trials  required  to  reach  80%  or  95%  accuracy  not  only  reached  but  in  fact 
exceeded  that  level.)  Times  tor  the  fastest  algorithm  at  each  ISI  for  each 
subject  are  followed  by  an  asterisk  (*) .  Times  for  the  fastest  combination  of  an 
algorithm  and  an  ISI  for  each  subject  are  followed  by  two  asterisks  (**).  Cases 
where  the  algorithm  did  not  result  in  80%  (a)  or  95%  (b)  accuracy  after  80  trials 
are  indicated  by  “X." 
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Abstract 

This  paper  presents  a  procedure,  Vector  filter,  that  decomposes  the 
event-related  brain  potential  into  components  on  the  basis  of  scalp 
distribution.  It  is  assumed  that  the  voltage  values  observed  at  several 
electrode  sites  at  any  point  in  time  is  given  by  the  linear  combination  of  a 
set  of  components  and  background  noise,  and  that  the  scalp  distribution  of 
each  component  is  invariant  and  known.  Each  component's  scalp  distribution 
is  expressed  by  a  set  of  weights,  one  for  each  electrode.  The  amplitude  of 
the  component,  at  any  point  in  time,  is  then  derived  using  a  least  squares 
criterion.  We  review  two  problems  in  the  use  of  the  procedure:  the 
selection  of  the  set  of  components  and  the  derivation  of  the  scalp 
distribution  of  each  of  the  components.  Two  application  of  the  procedure 
are  discussed:  identification  of  the  component  structure  of  an  event-related 
potential  waveform,  and  filtering  of  a  waveform  for  a  particular  component. 
Unlike  other  component  decomposition  procedures.  Vector  filter  can  be 
applied  when  the  latency  of  a  component  varies  as  a  function  of  trial, 
condition,  or  subject  population. 

,  DESCRIPTORS:  Multivariate  Analysis  of  ERPs,  Scalp  Distribution,  Vector 
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Event-related  brain  potentials  (ERPs)  are  usually  described  in  terms  of 
underlying  components.  These  components  are  assumed  to  vary  in  amplitude 
and  latency,  as  a  function  of  experimental  manipulations,  individual 
differences,  and  other  variables.  Furthermore,  several  components  can  be 
active  at  the  same  moment  in  time,  so  that  the  voltage  observed  at  any  time 
point  and  at  any  electrode  may  be  the  result  of  the  activity  of  several 
overlapping  components.  This  of  course  complicates  the  analysis  of  ERPs, 
since  it  is  often  difficult  to  decide  to  what  extent  the  voltage  observed  at 
a  particular  point  in  time  at  a  particular  electrode  can  be  attributed  to 
one  rather  than  another  component. 

Several  procedures  have  been  used  to  disentangle  overlapping 
components.  Most  of  these  procedures,  such  as  Principal  Component  Analysis 
(for  reviews  see  Coles,  Gratton,  Kramer,  &  Miller,  1986;  Donchin  &  Heffley, 
1978)  or  waveform  subtraction,  assume  that  the  latency  of  the  components  is 
invariant  across  different  conditions.  Principal  Component  Analysis  assumes 
that  the  timecourse  of  each  component  is  constant  across  a  set  of  data.  The 
subtraction  procedure  assumes  that  the  only  difference  between  two  waveforms 
is  the  presence  of  one,  target  component.  Thus,  both  the  timecourse  and  the 
amplitude  of  the  other,  "background"  components,  that  are  common  to  both 
waveforms,  are  assumed  to  be  fixed.  However,  it  is  evident  that  there  are 
circumstances  in  which  the  assumption  of  the  invariance  of  component  latency 
is  inappropriate.  This  is  especially  the  case  with  respect  to  late  ERP 
components  (such  as  P300,  N400,  slow  wave,  etc.). 

The  Vector  filter  procedure, 1  to  be  described  in  this  report,  relies  on 
the  distribution  of  the  voltages  across  the  scalp  (the  "scalp  distribution") 
as  the  anchor  point  for  the  decomposition  of  an  ERP  into  its  components.  As 
we  shall  see,  the  Vector  filter  procedure  provides  for  variability  in  the 
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latency  of  a  component. 

It  is  widely  believed  that  scalp  distribution  provides  at  least  one 
method  to  distinguish  among  components  (e.g.,  Donchin,  1978;  Donchin, 

Ritter,  &  McCallum,  1978;  Kutas  &  Hillyard,  1982;  Naatanen  &  Picton,  1987; 
Picton,  Woods,  Stuss,  &  Campbell,  1978).  A  corollary  of  this  belief  is  that 
components  are  characterized  by  particular  scalp  distributions. 2  This  view 
is  consistent  with  the  assumption  that  each  ERP  component  represents  the 
activation  of  a  particular  neuronal  ensemble  (or  set  of  ensembles).  While 
the  determination  of  these  ensembles  from  scalp  distribution  information  is 
an  extremely  difficult  task  (and  one  which  we  are  not  attempting  here),  "it 
is  somewhat  simpler  just  to  differentiate  components  of  the  evoked  potential 
on  the  basis  of  their  scalp  distribution.  Scalp-recorded  events  with 
different  voltage  distributions  must  derive  from  different  sources  ..." 
(Picton  et  al.,  1978,  p.  525). 

Vector  filter  provides  a  way  of  exploiting  this  view  in  order  to 
distinguish  among  the  contribution  of  several  components  to  an  obtained  ERP. 
As  its  name  implies,  "Vector  filter"  is  a  signal  enhancing  procedure.  The 
procedure  filters  for  a  particular  scalp  distribution,  which  is  believed  to 
characterize  the  signal  of  interest  (the  component).  In  general,  most 
techniques  for  analyzing  components  can  be  conceptualized  as  linear  filters, 
that  weight  temporal,  frequency,  or  spatial  information  in  order  to  enhance 
the  components  of  interest.  For  frequency  filters,  different  weights  are 
assigned  to  different  frequencies.  Thus,  a  low-pass  filter  of  10  Hz, 
assigns  a  high  weight  to  frequencies  below  10  Hz,  and  a  low  weight  to  higher 
frequencies.  (These  weights  are  generally  selected  a  priori . )  For  temporal 
filters,  different  weights  are  assigned  to  different  time  points  in  the  ERP 
waveform.  For  example,  for  the  P300  component,  the  weights  assigned  to 
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voltage  values  for  time  points  between,  say,  300  and  500  ms,  are  higher  than 
those  for  other  time  points.  (Note  that  the  weights  can  be  determined  in 
advance  or  by  using  statistical  techniques,  such  as  Principal  Component 
Analysis.)  In  the  case  of  spatial  filters,  different  weights  are  assigned 
to  voltages  recorded  from  different  electrode  locations.  For  example,  when 
we  select  one  particular  electrode  location,  we  assign  a  weight  of  1  to  that 
electrode,  and  a  weight  of  0  to  all  the  other  electrodes.  Vector  filter  is 
a  type  of  spatial  filter  that  allows  one  to  select  any  set  of  weights  for 
different  electrodes. 


Method 


The  model 

The  basic  model  used  by  Vector  filter  assumes  that  the  voltage  values 
observed  at  a  particular  moment  in  time  at  different  electrode  locations  are 
given  by  the  linear  combination  of  a  series  of  components  and  noise.  It  is 
assumed  that,  across  timepoints  and  repetitions,  the  scalp  distribution  of 
each  component  is  constant,  while  its  amplitude  is  variable.  The  scalp 
distribution  of  a  component  can  be  described  by  a  set  of  weights,  one  for 
each  scalp  electrode.  These  ideas  can  be  formalized  by  the  following 
expression  (I): 

(I)  Volt(ij)  =  Sum(k)  c( ik)W(jk)  +  Err(ij) 

Where  Volt(ij)  is  the  voltage  recorded  at  the  electrode  (j)  at 
tlmepoint  (i),  c(ik)  is  the  amplitude  of  component  (k)  at  timepolnt  (1), 
W(jk)  is  the  weight  of  component  (k)  at  electrode  (j),  and  Err(lj)  is  the 
noise  at  the  electrode  (j)  at  timepoint  (i).  If  we  define  spatial  filters 


Vector  filter 


Page  7 


in  terms  of  particular  sets  of  weights,  then  the  filters  are  identified  with 
the  scalp  distribution  associated  with  each  component. 

The  set  of  weights  for  each  component  are  assumed  to  be  known.  (In 
later  sections  of  this  paper  we  will  review  procedures  for  their 
estimation.)  Note  that  the  assumption  of  linearity  implies  that  the  scalp 
distribution  (ratio  among  values  at  different  electrodes)  remains  constant 
as  the  amplitude  of  the  component  changes.  Support  for  this  view  is 
provided  by  McCarthy  and  Wood  (1985).  These  authors  argue  that  when  there 
is  a  change  in  the  activity  of  a  source  underlying  a  component,  there  should 
be  a  proportional  change  in  the  scalp  recorded  activity  at  each  electrode. 
Derivation  of  f i ltered  waveforms 

The  procedure  comprises  three  critical  elements:  (a)  an  obtained  ERP 
waveform  for  each  of  several  electrodes  (the  data  set)  for  which  we  want  to 
know  the  relative  contribution  (amplitude)  of  the  constituent  components  at 
each  time  point;  (b)  a  set  of  spatial  filters,  each  defining  a  component, 
which  take  the  form  of  vectors  of  weights,  one  per  electrode,  that  specify 
the  extent  to  which  a  component  is  active  at  the  electrode  --  these  weights 
thus  describe  the  scalp  distribution  of  each  component;  and  (c)  an  output 
waveform  (amplitude  by  time  function)  for  each  component  that  describes  its 
amplitude  at  each  time  point  in  the  obtained  ERP  waveforms.  Note  that  the 
procedure  results  in  the  transformation  of  the  input  data  set  (one  ERP 
waveform  for  each  electrode)  into  an  output  set  of  waveforms  (one  for  each 
component) . 

This  tranformation  is  accomplished  in  the  following  way.  First,  for 
each  timepoint,  we  determine  the  combination  of  the  components  which 
provides  the  optimal  approximation  to  the  observed  data.  This  is 
accomplished  by  estimating  a  particular  "beta  coefficient"  (labelled  c ( 1  k ) 
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in  equation  (1))  for  each  component.  The  beta  coefficients  are  taken  as 
estimates  of  the  amplitude  of  the  components  at  that  particular  point  in 
time.  The  procedure  is  then  repeated  for  each  time  point  in  the  epoch,  so 
that,  for  each  component,  we  get  amplitude  by  time  functions,  depicting 
their  time  courses. 

The  crucial  step  in  the  procedure,  therefore,  is  the  estimation  of  the 
beta  coefficients  to  assign  to  each  component  to  produce  the  optimal 
approximation  to  the  observed  data.  This  is  accomplished  by  applying  a 
least  squares  procedure  (see  Table  1)  in  which  we  determine  the  coefficients 
that  should  be  given  to  each  component  to  minimize  the  error  in  the 
description  of  voltage  values  observed  at  all  the  electrodes  for  that 
timepoint.  Note  that  the  components  are  used  as  predictors,  and  the 
observed  data  (values  at  the  different  electrodes)  as  the  criterion. 


Insert  Table  1  about  here 


Table  1  shows  how  the  observed  values  at  the  different  electrode 
locations  (1  to  N)  at  a  particular  time  point  can  be  considered  as  a 
"vector"  of  criterion  values  (Volt  (1)  to  Volt  (N)  in  Table  1),  that  are 
optimally  approximated  by  the  linear  combination  of  a  set  of  components  (A 
to  Z),  each  of  which  is  characterized  by  a  “vector"  of  values  (e.g.,  A(l)  to 
A(N) ) ,  one  for  each  electrode  location.  Both  vectors  can  be  visually 
represented  as  "profiles"  of  values  at  different  locations.  Thus,  we  can 
have  a  profile  describing  the  data  observed  at  a  particular  time  point,  and 
a  profile  describing  each  of  the  components.  The  "free"  parameters  in  the 
system  of  equations  presented  in  Table  1  are  the  beta  coefficients  of  the 
components  at  that  particular  time  point  (c(A)  to  c ( Z ) ) .  Note  that  these 


Vector  f i  1  ter 


Page  9 


values  are  fixed  across  all  the  electrode  locations.  We  take  the  beta 
coefficients  as  representing  the  amplitude  of  each  component  for  that  time 
point.  The  estimation  of  the  beta  coefficients  (amplitudes  of  the 
components)  is  repeated  for  each  time  point  in  the  ERP  waveform.  When  these 
coefficients  are  plotted  over  time,  the  result  is  a  set  of  amplitude  by  time 
functions,  depicting  the  time  course  of  each  component.  These  amplitude  by 
time  functions  can  also  be  considered  as  the  outputs  of  a  set  of  spatial 
filters,  with  each  filter  "tuned*'  to  the  scalp  distribution  of  a  particular 
component . 

Charactenst  ics  of  the  procedure 

vector  filter  is  based  on  a  multiple  regression  (for  each  time  point) 
in  which  the  set  of  predictors  is  given  by  the  components,  the  criterion  is 
given  py  the  voltage  values  observed  at  the  different  electrode  locations, 
and  the  regression  coefficients  are  used  as  estimates  of  the  amplitude  of 
each  component  at  each  data  point.  Thus,  several  properties  of  multiple 
regress  ion  app  l> . 

a.  Llil  of  the  mode  I .  It  is  possible  to  assess  the  degree  to  which  the 
particular  set  of  components  that  are  chosen  actually  account  for  the 
variance  present  in  the  data.  This  is  accomplished  by  computing  the 
multiple  correlation  coefficient.  This  statistic  can  be  used  to  determine 
the  value  of  the  model  in  accounting  for  the  observed  data. 

b.  Number  of  components .  Two  considerations  govern  the  choice  of  the 
number  of  components  to  be  used,  first,  a  large  proportion  of  the  variance 
in  scalp  distribution  should  be  explained.  Second,  the  amplitude  estimates 
for  each  component  should  be  reliable.  These  two  criteria  leads  to  opposing 
strategies.  The  more  components  that  are  used,  the  more  variance  will  be 
explained,  but  the  lower  will  be  the  reliability  of  the  amplitude  estimates. 


Vector  filter 


Page  10 


To  resolve  this  dilemma,  we  note  that  the  reliability  of  the  amplitude 
estimates  will  depend  on  the  degree  to  which  a  particular  set  of  components 
accounts  for  more  variance  than  is  accounted  for  by  an  equal  number  of 
randomly  selected  components  (see  McNemar,  1969,  and  other  specialized  texts 
for  tests  of  significance  of  this  difference).  Note  that  the  proportion  of 
variance  explained  by  a  set  of  components  chosen  at  random  is  equal,  as  an 
average,  to  (n-l)/(N-l) ,  where  n  is  the  number  of  components,  and  N  is  the 
number  of  electrodes  (see  McNemar,  1969,  p.  203). 

As  an  example  of  the  application  of  these  concepts,  consider  the  case 
in  which  we  want  to  account  for  80%  of  variance  in  scalp  distribution.  What 
is  the  maximum  number  of  components  that  can  be  included  in  the  set  given 
that  we  need  to  obtain  reliable  coefficients  with  N  electrodes?  Based  on 
procedures  described  in  McNemar  (1969),  this  number  can  be  computed  by, 
first,  establishing  the  critical  F  (for  a  multiple  R  of  .9),  and,  second, 
determining  the  ratio  between  the  number  of  Components  and  the  number  of 
electrodes  to  achieve  this  critical  F.  It  turns  out  that  this  ratio  is  such 
that  the  number  of  components  should  be  smaller  than  (N+l)/2.  Thus,  as  a 
general  rule  of  thumb,  it  can  be  expected  that  low  reliability  will  obtain 
whenever  the  number  of  components  is  more  than  half  the  number  of 
electrodes. 

c.  Interrelationship  between  components.  Since  the  components  are 
defined  in  terms  of  scalp  distribution,  it  is  clear  that  two  components  with 
the  same  scalp  distribution  cannot  be  entered  simultaneously  in  the 
procedure.  Even  when  two  components  have  very  similar  (although  not 
equivalent)  scalp  distributions,  the  amplitude  estimates  for  each  of  the  two 
components  will  not  be  reliable.  In  fact,  small  fluctuations  in  the 
observed  scalp  distribution  may  result  in  large  variations  in  the  amplitude 
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estimates  of  the  two  components.  (This  problem  is  also  known  as  the 
"problem  of  mu lticol linearity"  in  multiple  regression  analysis.)  Therefore 
it  is  not  advisable  to  use  components  with  very  similar  scalp  distributions. 
The  similarity  in  scalp  distribution  between  two  components  can  be  measured 
by  using  (2): 

(2)  R( 12)  =  Sum(j)  A(j)*B(j)  /  Sqrt( (Sum(j)  A( j)**2)*(Sum( j)  B(J)**2))) 

where  R ( 12 )  is  a  measure  of  the  similarity  between  the  scalp  distributions 
of  any  pair  of  components  (A  and  B)  for  which  the  weight  at  electrode  (j) 
are  A(j)  and  B(j),  respectively.  In  no  case  should  two  components  with  an 
R( 12 )  higher  than  0.8  be  entered  simultaneously  (see  Lewis-Beck,  1980,  p. 

60;  this  and  other  specialized  texts  also  describe  more  sophisticated 
methods  of  detecting  and  dealing  with  multicol 1 inearity) . 

Although  the  problem  of  component  similarity  may  appear  to  limit  the 
applicability  of  the  procedure,  it  can  be  dealt  with  in  a  number  of  ways, 
each  of  which  increases  the  discriminability  between  the  scalp  distributions 
of  components.  One  approach  is  to  increase  the  number  of  electrodes  and  to 
choose  electrode  locations  that  maximize  the  discrimination  between 
components.  Another  approach  is  to  adopt  a  Laplacian  reference  system 
(Hjorth,  1979).  This  procedure,  which  is  technically  equivalent  to 
computing  the  current  source  flow,  tends  to  produce  localized  components. 
Localized  components  will  usually  have  quite  distinguishable  scalp 
distributions,  and  very  low  R( 12) * s . 

d.  Vector  filter  and  other  linear  transformations.  As  we  have  seen, 
the  Vector  filter  procedure  is  based  on  a  linear  model.  One  consequence  of 
this  fact  is  that  it  can  be  performed  before  or  after  other  linear 
manipulations  of  the  ERP  data.  For  instance,  applying  Vector  filter  to 
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single  trial  data,  and  then  averaging  the  vector  output  waveforms,  will 
produce  results  that  are  equal  to  those  obtained  by  averaging  the  single 
trial  input  data  and  then  performing  the  Vector  filter  operation  on  the 
average  waveforms.  This  property  makes  Vector  filter  suited  for  the 
analysis  of  single  trials. 


Applying  Vector  filter 

In  this  section  we  consider  two  critical  issues  in  the  application  of 
Vector  filter.  First,  we  discuss  the  problem  of  selecting  the  appropriate 
set  of  components,  and  we  examine  the  consequences  of  failure  to  select  the 
appropriate  set.  Second,  we  review  approaches  to  the  problem  of  specifying 
the  scalp  distribution  of  each  component  in  the  set,  and  present  a  practical 
example. 

Selecting  the  set  of  components 

The  Vector  filter  procedure  is  based  on  the  assumption  that  a  given  set 
of  components,  with  known  scalp  distribution,  is  responsible  for  the 
observed  ERP.  How  is  the  set  of  components  selected? 

The  answer  to  this  question  depends  in  part  on  the  aims  that  underlie 
the  application  of  Vector  filter.  If  the  purpose  is  to  isolate  the  part  of 
the  ERP  waveform  that  contains  the  components  of  interest,  then  it  is 
sufficient  to  merely  use  these  components  as  the  set.  In  this  case,  the 
application  of  Vector  filter  corresponds  to  the  application  of  other  kinds 
of  s ignal -enhancing  procedures.  For  instance,  frequency  filters  are  used  to 
isolate  activity  in  the  ERP  waveform  that  has  a  particular  frequency  range. 
Note  that.  In  the  case  of  frequency  filters.  It  is  assumed  that  the 
component  of  interest  has  a  specifiable  frequency  range.  Similarly,  in  the 
case  of  Vector  filter,  we  assume  that  the  component  of  interest  has  a 
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specifiable  scalp  distribution.  Note  that  we  do  not  need  to  know  the  scalp 
distribution  (or  frequency  characteristics)  of  aJ2  the  components  present  in 
the  ERP  waveform. 

On  the  other  hand,  when  the  aim  of  the  procedure  is  to  decompose  a 
particular  ERP  waveform  into  its  component  structure,  then  more 
sophisticated  selection  procedures  are  required.  These  procedures  can  be 
based  on  a  "step-wise"  approach,  in  which  the  goodness  of  fit  provided  by  a 
particular  set  of  components  can  be  estimated,  following  addition  or 
subtraction  of  particular  components.  In  this  case,  a  set  of  theoretically 
interesting  components  can  be  selected  first,  and  the  fit  of  the  model  based 
on  this  set  to  the  ERP  can  be  assessed  for  each  data  point,  or  across  a 
number  of  data  points.  Then,  depending  on  the  results  of  the 
goodness-of-fit  test,  new  components  may  be  added  or  old  components  may  be 
deleted.  A  criterion  for  halting  this  step-wise  operation  may  be  the 
determination  of  the  simplest  model  ( 1  .e . ,  based  on  the  least  number  of 
components)  that  produce  a  satisfactory  fit  to  the  data  (with  a  probability 
of  rejection  of  less  than  .05). 

These  procedures  for  selecting  the  set  of  components  are  predicated  on 
a  priori  knowledge  of  the  possible  components  present  in  the  ERP  waveform. 

In  most  ERP  experiments,  the  investigator  approaches  the  data  with  some 
hypotheses  about  the  components  that  should  be  elicited  in  the  particular 
experimental  paradigm.  Thus,  it  is  not  unreasonable  to  expect  that  the  set 
of  components  can  be  specified  in  advance  of  the  application  of  Vector 
filter.  However,  it  is  always  possible  that  the  investigator  is  mistaken 
and  fails  to  include  components  that  are  really  present.  What  are  the 
consequences  of  this  error? 

In  the  rest  of  this  section  we  review  these  consequences  by  applying 
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Vector  filter  to  simulated  data,  for  which  we  know  the  set  of  components 
that  are  present.  The  simulated  waveforms  were  obtained  by  adding 
components  with  fixed  scalp  distributions  to  each  other.  The  distributions 
are  described  by  a  "profile"  of  weights  (one  weight  for  each  electrode  -  see 
upper  row  of  Figure  1).  The  amplitude  of  each  input  component  varied  over 
time  (see  second  row  of  Figure  1).  To  obtain  the  ERP  waveform  for  each 
electrode  (Fz,  Cz,  and  Pz) ,  the  amplitude  of  each  component  for  each  time 
point  is  multiplied  by  the  weight  of  that  component  at  each  electrode,  and 
the  amplitudes  for  different  components  are  then  summed  (see  third  row  of 
Figure  1).3 


Insert  Figure  1  About  Here 


Applying  the  Vector  filter  operation  to  the  ERP  waveform  (third  row) 
yields  the  waveforms  shown  in  the  lower  row.  These  waveforms  represent  the 
time  course  of  the  two  components  retrieved  by  the  procedure  when  (a)  all 
the  components  present  in  the  data  are  considered,  and  (b)  the  weights  for 
each  component  are  the  same  as  those  of  the  simulated  components  (shown  in 
the  upper  panel).  Note  that,  not  surprisingly.  Vector  filter  perfectly 
retrieves  the  amplitude  and  time  course  of  each  original  component. 

In  the  next  two  simulations  we  examine  cases  in  which  the  set  of 
components  used  for  Vector  filter  is  a  subset  of  the  input  set  --  that  is, 
we  have  failed  to  include  some  of  the  components  present  in  the  data.  In 
most  cases  this  will  result  in  some  error  in  the  retrieval  of  the  original 
components.  An  example  Is  given  by  the  second  simulation,  illustrated  in 
Figure  2. 
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Insert  Figure  2  About  Here 


In  this  simulation,  we  used  the  same  simulated  ERP  waveform  used  in 
simulation  1.  However,  only  Component  A  was  entered  in  the  Vector  filter 
procedure,  while  Component  B  was  ignored.  The  error  is  illustrated  in  the 
lower  panel,  where  the  dotted  line  indicates  the  "input"  Component  A,  while 
the  solid  line  depicts  the  Component  A  as  retrieved  by  the  Vector  filter 
procedure. 

Simulation  3  illustrates  a  case  in  which  ignoring  one  component  does 
not  result  in  a  retrieval  error  for  the  other  components.  In  this 
simulation,  we  have  added  a  third  component  (Component  C)  to  the  set  of 
input  components.  This  simulation  is  illustrated  in  Figure  3. 


Insert  Figure  3  About  Here 


Again,  the  first  and  second  rows  depict  the  scalp  distribution  and  time 
course  of  the  input  components,  the  third  row,  the  composite  ERP  waveform, 
and  the  bottom  row,  the  outputs  of  the  Vector  filter  procedure,  when  only 
two  components  (A  and  B)  were  included  in  the  set.  In  this  case,  failure  to 
include  a  component  (C)  that  was  present  in  the  original  ERP  waveform,  did 
not  lead  to  any  error  in  the  retrieval  of  components  A  and  B. 

In  general,  the  error  introduced  by  ignoring  one  or  more  of  the 
components  depends  on  the  amplitude  of  the  ignored  components  and  on  the 
degree  of  similarity  between  the  component  that  Is  Ignored  and  those  that 
are  entered  in  the  analysis.  In  simulation  2,  the  component  Ignored  (B) 
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has  a  scalp  distribution  that  is  quite  similar  to  that  of  the  component 
entered  in  the  analysis  (A),  R ( 1 2 ) =0 . 38 .  In  simulation  3,  the  component 
ignored  (C)  has  a  scalp  distribution  that  is  very  dissimilar  to  that  of  the 
components  entered  in  the  analysis  (A  and  B):  both  the  R(12)'s  for  A  and  C 
and  for  B  and  C  were  equal  to  0. 

Taken  together,  simulations  2  and  3  reveal  that  inaccurate 
specification  of  the  set  of  components  will  have  minimal  consequences  under 
some  conditions.  Specifically,  when  the  omitted  components  are  either  very 
small  or  have  scalp  distributions  that  are  very  dissimilar  from  those  of  the 
components  that  are  included  in  the  set. 

In  practical  cases,  the  estimation  of  the  dissimilarity  between 
included  and  omitted  components  requires  the  specification  of  the  scalp 
distribution  of  the  omitted  component.  If  one  can  specify  the  distribution 
of  a  component,  why  not  include  it  in  the  set?  As  we  noted  above,  inclusion 
of  a  large  number  of  components  in  the  set  may  lead  to  unreliability  in  the 
estimation  of  component  amplitudes.  This  will  occur  when  the  number  of 
components  in  the  set  approaches  the  number  of  electrodes.  If  one  cannot 
specify  the  distribution  of  an  omitted  component,  how  can  one  estimate  its 
similarity  to  the  Included  components?  Clearly,  estimation  of  similarity  in 
this  case  is  impossible.  However,  one  can  take  steps  to  reduce  the 
likelihood  of  similarity  by  (a)  choosing  electrode  locations  that  maximally 
specify  components  of  interest,  and/or  (b)  using  the  Laplacian  derivation  or 
similar  procedures  to  localize  the  scalp  distribution  of  components. 
Estimating  component  distribution 

As  we  have  noted,  a  fundamental  requirement  for  the  use  of  Vector 
filter  is  specification  of  the  distribution  of  the  components  to  be  Included 


in  the  set. 
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There  are  several  ways  in  which  the  scalp  distribution  of  a  particular 
component  can  be  estimated.  Although  each  of  these  ways,  taken  alone,  may 
not  always  be  satisfactory,  the  convergence  of  several  methods  can  be  taken 
as  strong  evidence  in  favor  of  the  hypothesis  that  a  particular  scalp 
distribution  can  represent  the  component.  The  methods  to  estimate  the  scalp 
distribution  of  a  component  can  be  divided  into  three  categories: 
biophysical  methods,  statistical  methods,  and  experimental  methods. 

a.  Biophysical  methods.  These  methods  rely  on  the  fact  that  knowledge 
of  the  source  of  a  component  can  be  used  to  predict  its  scalp  distribution. 
For  example,  knowledge  of  the  functional  anatomy  of  the  motor  cortex  enabled 
Brunia  (1980)  to  account  for  the  different  distributions  of  potentials  that 
precedes  foot  and  finger  movements.  At  present,  knowledge  of  the  sources  of 
most  ERP  components  is  limited.  However,  as  such  knowledge  becomes 
available,  so  biophysical  methods  may  be  used  to  provide  descriptions  of  the 
scalp  distribution  of  a  component. 

b.  Statistical  methods.  A  second  class  of  methods  includes  those  that 
are  based  on  some  statistical  or  mathematical  properties  of  the  data.  For 
instance,  by  analyzing  the  pattern  of  variance  and  covariance  among 
electrodes  at  a  particular  time  point,  Lehmann  and  Skrandies  (1980)  and 
Wood,  McCarthy,  and  Oarcey  (1986)  were  able  to  derive  a  set  of  components  to 
describe  the  scalp  distribution  at  that  time  point.  Each  of  the  components 
could  be  described  in  terms  of  its  own  scalp  distribution.  To  the  extent 
that  this  scalp  distribution  can  be  reliably  observed,  it  can  be  used  to 
define  the  scalp  distribution  of  that  particular  component. 

Another  statistical  procedure  to  determine  an  optimal  Vector  filter  is 
to  select  that  set  of  weights  that  best  discriminates  between  two  data  sets, 
one  of  which  is  believed  to  contain  the  component  of  interest.  For  example. 
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discriminant  analysis  can  be  used  to  derive  the  set  of  scalp  distribution 
weights  that  best  discriminate  between  the  single  trial  ERPs  to  expected  and 
unexpected  events.  If  one  believes  that  these  two  classes  of  events  elicit 
P300s  of  different  magnitudes  (and  that  this  is  the  only  difference  between 
the  two  classes),  then  it  is  possible  to  use  the  set  of  weights  to 
characterize  the  P300  component.  This  may  be  particularly  useful  when  few 
electrodes  are  used  such  that  only  one  filter  can  be  applied. 

Information  about  the  constancy  of  a  particular  scalp  distribution  over 
time  can  also  be  used  to  arrive  at  a  definition  of  components.  This  method 
is  based  on  the  assumption  that,  if  a  number  of  consecutive  time  points 
share  a  similar  scalp  distribution,  they  can  be  attributed  to  the  same 
underlying  component.  Thus,  Lehmann  and  Skrandies  (1986)  divide  the 
observed  ERP  waveform  into  segments,  such  that  the  time  points  within  each 
segment  are  associated  with  a  "constant"  scalp  distribution.  Each  segment 
is  then  defined  as  a  component.  Constancy  occurs  when,  for  two  timepoints, 
the  maximum  and  minimum  voltages  occur  at  the  same  (or  adjacent)  electrodes. 
Constancy  could  also  be  defined  in  terms  of  the  magnitude  of  the  R(12)'s 
(correlation)  between  two  distributions,  as  described  above. 

Other  techniques  are  available  that  rely  on  more  restrictive  models  of 
ERPs,  and  which  may  allow  one  to  derive  analytically  the  scalp  distribution 
of  the  ERP  components.  For  instance,  Mocks  (1987)  recently  proposed  a  model 
that  assumes  that  the  ERP  is  the  result  of  the  linear  combination  of  a 
limited  set  of  components  whose  time  course  and  scalp  distribution  are 
fixed,  although  not  known.  When  these  assumptions  are  met,  it  is  possible 
to  estimate  the  time  course  and  scalp  distribution  of  each  component  from 
the  data.  Although  this  procedure  may  be  promising  in  some  cases,  it  is  not 
applicable  when  the  time  course  of  a  component  varies. 
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Scherg  and  von  Cramon  (1985)  proposed  a  mixed  biophysical/analytical 
procedure  to  disentangle  the  component  structure  of  the  ERP.  Their 
procedure  uses  both  temporal  and  spatial  information.  First,  each  component 
is  assigned  a  specific  waveshape,  although  some  flexibility  in  latency  is 
allowed.  Initial  hypotheses  are  made  about  the  nature,  locus  and 
orientation  of  the  source  of  each  component,  and  their  consequent  scalp 
distribution.  The  ERP  predicted  by  this  model  is  then  computed,  and  the 
discrepancy  between  the  predicted  and  observed  ERP  is  measured.  The 
parameters  (latency,  source  localization  and  source  orientation  of  each 
component)  are  then  adjusted  to  reduce  the  discrepancy.  The  procedure  is 
repeated  until  a  convergence  toward  a  particular  solution  is  reached.  This 
approach  appears  to  provide  a  promising  way  of  integrating  biophysical  and 
statistical  methods.  However,  like  other  procedures,  it  is  based  on  a 
number  of  assumptions,  including  the  number,  time  course,  and  nature  of  the 
components. 

c.  Experimental  methods.  These  methods  are  predicated  on  the  belief 
that  there  is  some  experimental  setting  that  can  be  used  to  provide  a  "pure" 
instance  of  a  particular  component,  for  example,  the  ERP  elicited  by  the 
omission  of  an  expected  stimulus  is  thought  to  represent  a  P300  that  is 
uncontaminated  by  the  superimposition  of  exogenous  components  (Simson, 
Vaughan,  &  Ritter,  1976).  Alternatively,  P300  can  be  defined  in  terms  of 
that  aspect  of  an  ERP  waveform  that  is  probability  dependent  (Squires  & 
Donchin,  1976).  In  the  case  of  Processing  Negativity,  the  component  is 
identified  by  subtraction  of  two  waveforms,  those  for  attended  and 
unattended  stimuli  (Naatanen,  1982).  The  difficulty  with  these  procedures 
is  that  it  must  be  asssumed  that  the  derived  waveforms  contain  one  and  only 
one  component.  If  this  condition  is  satisfied,  then  the  scalp  distribution 


Vector  filter 


Page  20 


of  the  pure  component  can  be  used. 

d.  Evaluation  of  the  methods  for  estimating  scalp  distribution.  In 
summary,  several  procedures  can  be  used  to  identify  the  scalp  distribution 
of  an  ERP  component.  For  theoretical  reasons,  biophysical  methods  are 
clearly  the  most  appropriate.  However,  since  knowledge  of  the  sources  of 
ERP  components  is  limited  at  the  present  time,  one  must  resort  to  other 
techniques.  Although  none  of  these  appears  to  be  applicable  in  all  cases, 
the  convergence  of  several  techniques  towards  a  similar  solution  may  be 
considered  as  a  strong  evidence  for  that  solution.  Such  convergence  is 
illustrated  in  the  next  section. 

e.  A  practical  example:  The  scalp  distribution  of  P300.  In  this 
section  we  provide  an  example  of  how  different  procedures  used  to  estimate 
the  scalp  distribution  of  P300  converge  on  a  common  solution.  The  data  from 
this  example  come  from  a  study  by  Miller  et  al .  (1987).  During  this  study, 
10  subjects  were  run  in  four  tasks.  In  the  first  task  (AUDITORY  COUNT 
50/50),  the  subjects  had  to  count  one  of  two  tones,  where  both  tones  had  a 
probability  of  .50.  In  the  second  task  (AUDITORY  RT  20/80),  the  subjects 
had  to  give  a  speeded  response  to  one  of  two  tones,  with  probabilities  of 
.80  and  .20  respectively.  In  the  third  task  (AUDITORY  OMITTED  STIMULUS 
10/90),  the  subjects  were  presented  with  a  series  of  tones;  on  10%  of  the 
trials  the  tone  was  omitted,  and  the  subjects  had  to  count  the  number  of 
omissions.  In  the  fourth  task  (VISUAL  NAMES  20/80),  the  subjects  were 
presented  with  male  names  (p=.20)  and  female  names  (p=.80)  and  had  to  count 
the  number  of  male  names.  ERPs  were  recorded  from  Fz,  Cz,  and  Pz, 
referenced  to  linked  mastolds.  The  grand-average  waveforms  for  each 
electrode,  task,  and  stimulus  type  are  presented  in  Figure  4. 
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Insert  Figure  4  About  Here 


A  "P300"  is  clearly  noticeable  for  all  the  rare  or  target  stimuli.  For 
the  purposes  of  this  demonstration  we  derived  four  definitions  of  P300,  one 
for  each  task.  Thus  P300  was  defined  as: 

(a)  the  difference  (at  each  electrode)  between  the  amplitude  at  the 
moment  of  the  maximum  positive  peak  after  300  ms  for  the  "target"  and 
"non-target"  trials  in  the  AUDITORY  COUNT  50/50  task  ("target 
effect/auditory” ) ; 

(b)  the  difference  (for  each  electrode)  between  the  amplitude  at  the 
moment  of  the  most  positive  peak  after  300  ms  for  the  "rare"  and  "frequent" 
trials  in  the  AUDITORY  RT  20/80  task  ("probability  effect/auditory"); 

(c)  the  amplitude  (at  each  electrode)  at  the  moment  of  the  most 
positive  peak  after  300  ms  for  the  "omitted"  stimulus  trials  in  the  AUDITORY 
OMITTED  STIMULUS  10/90  task  ("omitted  stimulus/auditory"); 

(d)  the  difference  (for  each  electrode)  between  the  amplitude  at  the 
moment  of  the  most  positive  peak  after  300  ms  for  the  "rare"  and  "frequent" 
trials  in  the  VISUAL  NAME  20/80  task  ("target  and  probability 
effects/visual") . 

This  set  of  independent  procedures  yields  different  definitions  of  P300 
scalp  distribution.  They  cover  different  modalities,  latencies, 
probabilities,  task  requirements,  and  included  different  sets  of  possible 
overlapping  components.  Note  that  all  these  procedures  belong  to  the  class 
of  "experimental  methods"  as  described  above. 

The  mean  P300  amplitudes  at  each  electrode  for  each  of  the  four 
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definitions  are  presented  in  Table  2.  In  this  table,  the  average  of  the 
four  definitions  of  P300  for  each  electrode  is  also  shown. 


Insert  Table  2  about  here 


To  determine  whether  these  four  definitions  of  P300  result  in  a 
consistent  scalp  distribution,  we  computed  the  R( 12)  between  each  pair  of 
definitions,  and  between  each  definition  and  the  average  of  the  four 
definitions.  The  resulting  values  of  R( 12)  are  shown  in  Table  3. 


Insert  Table  3  about  here 


It  is  clear  from  Table  3  that  the  different  definitions  yield  very  similar 
scalp  distributions,  and  that  each  distribution  is  very  similar  to  the 
average  distribution.  Thus,  it  appears  that  the  four  procedures  used  to 
estimate  the  scalp  distribution  of  P300  converge  on  the  same  solution, 
represented  by  the  average  definition. 

Discussion 

In  this  paper  we  have  seen  how  one  can  proceed  from  the  assumption  that 
ERP  components  are  characterized  by  particular  scalp  distributions,  to  the 
identification  and  quantification  of  ERP  components  In  an  observed  waveform. 
We  noted  that  this  assumption  is  held  by  many  researchers  in  the  field,  and 
that  it  appears  to  be  based  on  the  belief  that  two  different  scalp 
distributions  cannot  be  produced  by  the  same  neural  generator. 

Although  this  assumption  is  widespread,  it  has  generally  been  used  only 


as  a  qualitative  guide  to  component  Identification.  The  assumption  leads  to 
the  selection  of  particular  recording  locations.  For  instance,  studies  of 
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movement  related  potentials  generally  include  lateral  electrode  placements, 
while  those  of  P300  include  midline  montages.  The  assumption  also  often 
leads  to  the  selection  of  one  particular  scalp  site  for  the  analysis,  where 
the  component  is  supposed  to  be  maximally  evident. 

The  procedure  we  have  proposed  (Vector  filter)  allows  the  investigator 
to  take  advantage  of  the  quantitative  implications  of  the  assumption. 

Vector  filter  is  based  on  a  model  that  assumes  that  the  observed  ERP  is 
given  by  the  linear  combination  of  a  set  of  components.  Note  that  the 
following  assumptions  are  made:  (a)  the  set  of  components  is  limited,  (b) 
the  scalp  distribution  of  each  component  is  known  and  fixed,  and  (c)  the 
components  combine  in  a  linear  fashion  (without  interaction)  in  determining 
the  composite  ERP.  We  have  shown  how  investigators  can  choose  the  set  of 
components,  and  we  have  discussed  the  consequences  of  choosing  the  incorrect 
set.  We  have  also  considered  ways  in  which  the  scalp  distribution  of  a 
particular  component  can  be  established,  and  we  have  illustrated  how  four 
different  ways  of  defining  the  scalp  distribution  of  P300  converge.  We  have 
not  addressed  the  validity  of  the  assumption  of  linear  combination. 

However,  it  should  be  noted  that  this  assumption  is  common  to  most 
decomposition  techniques  (cf.  PCA,  waveform  subtraction). 

There  are  two  ways  in  which  Vector  filter  can  be  useful.  First,  Vector 
filter  can  provide  a  decomposition  of  the  ERP  into  components.  This  use  of 
the  procedure  requires  that  the  three  assumptions  reviewed  in  the  previous 
paragraph  apply.  Violation  of  these  assumptions  will  lead  to  a 
misrepresentation  of  the  component  structure.  Note  that  one  can  test  the 
degree  to  which  the  chosen  set  of  components  account  for  the  observed  data 
using  statistical  tests.  If  these  tests  reveal  that  the  model 
satisfactori ly  accounts  for  the  data,  then  the  model  and  its  assumptions  are 
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supported.  Note  however  that  "a  good  fit"  does  not  exclude  the  possibility 
that  other  models  might  apply. 

A  second  use  of  Vector  filter  is  to  enhance  the  signal  (component  of 
interest)  with  respect  to  other  aspects  of  the  ERP  waveform.  In  this  case, 
the  procedure  is  used  to  "filter  for"  the  scalp  distribution  of  a  particular 
component,  or  to  "filter  out"  the  contribution  of  components  or  noise  with 
different  scalp  distributions.  In  this  case,  it  is  not  necessary  for  the 
assumptions  that  underlie  the  procedure  to  be  completely  valid.  Just  as 
with  frequency  filtering  it  is  not  necessary  to  know  the  exact  frequency  of 
the  signal  of  interest  (and  of  the  noise),  so  with  Vector  filter  it  is  not 
necessary  to  know  the  exact  scalp  distribution  of  the  component  of  interest 
(and  of  the  noise).  (An  Investigation  of  the  effects  of  imprecision  in  the 
estimate  of  component  distribution  is  presented  In  a  companion  paper, 
Gratton,  Kramer,  Coles,  &  Oonchin,  submitted.) 

Both  these  uses  of  Vector  filter  represent  solutions  to  the  problem  of 
isolating  overlapping  components.  The  advantage  of  the  procedure  over  other 
methods  (e.g.,  PCA,  waveform  subtraction)  is  that  it  does  not  require  that 
the  latency  and  timecourse  of  the  components  are  constant  over  different 
trials,  subjects,  or  experimental  conditions.  Thus,  Vector  filter  appears 
to  be  particularly  suitable  for  studies  that  Involve  latency  variability. 

We  have  evaluated  the  use  of  Vector  filter  to  improve  the 
signal-to-noise  ratio  of  the  P300  component  in  two  studies.  In  the  first 
study  (Gratton  et  al.,  submitted)  we  simulated  complex  ERPs  by  adding 
several  different  components  and  noise  to  a  P300.  In  this  study  we  found 
that  Vector  filter  helps  In  the  detection  of  P300,  by  improving  the 
estimation  of  P300  latency  by  approximately  25*.  In  a  second  study 
(Fabiani,  Gratton,  Karls,  &  Oonchin,  in  press)  we  measured  the  reliability 
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of  P300  amplitude  and  latency  estimates  taken  with  several  different 
procedures  on  observed  data.  In  these  studies  we  found  that  the  best 
results  are  obtained  when  Vector  filter  is  used  in  association  with 
time-domain  (peak-picking  and  cross-correlation)  procedures  for  the 
identification  of  P300. 

The  examples  of  the  use  of  Vector  filter  reviewed  above  show  how  scalp 
distribution  information  can  be  used  in  the  analysis  of  ERP  waveforms, 
despite  the  limited  knowledge  available  at  present  about  the  sources  of 
components.  It  is  clear,  however,  that  as  this  knowledge  accumulates,  so 
the  procedure  will  become  more  and  more  reliable. 
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Footnotes 

1.  The  term  "Vector  filter"  was  originally  used  to  refer  to  the 
particular  geometrical  properties  of  this  procedure  when  a  multivariate 
description  of  ERP  scalp  distribution  data  is  used.  However,  in  the  present 
paper,  we  adopted  an  alternative  approach  to  the  description  of  the 
procedure.  We  believe  that  this  non-geometric  approach  is  easier  both  for 
us  to  present  and  for  the  reader  to  follow.  However,  for  historical 
reasons,  we  have  chosen  to  maintain  the  label  "Vector  filter." 

2.  For  example,  the  Contingent  Negative  Variation  (CNV)  is  described  as 
"largest  at  the  vertex  ...  Amplitudes  also  show  a  progressive  diminution 
anteriorly  and  posteriorly  from  central  regions  ..."  (Donchin  et  al.,  1978, 
pp.  357-358).  The  P300  “is  characterized  by  its  scalp  distribution  as  it 
tends  to  be  larger  at  central  and  parietal  electrodes"  (Donchin  et  al., 

1978,  p.  356).  The  N400  "has  a  marked  posterior  distribution  over  the 
scalp,  with  a  slight  but  consistent  right  hemispheric  predominance  ..." 
(Kutas,  lindamood,  &  Hillyard,  1984,  p.  219).  Naatanen  and  Picton  (1987) 
use  scalp  distribution  and  other  procedures  to  distinguish  six  components  of 
the  auditory  Nl. 

3.  In  this  example  we  used  three  electrodes  (Fz,  Cz,  and  Pz),  but  the 
number  of  electrode  locations  used  can  be  any  number  larger  than  1. 
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Table  1.  Expanded  Model 


Voltages 

Contribution  of  each  component  (A,B,...,Z) 

Error  term 

observed 

to  the  voltage  observed  at  each  electrode. 

for  each 

at  each 

given  by  the  product  of  the  coefficients  (c) 

electrode 

electrode 

of  the  component  and  of  its  weight  for  that 

electrode  (A,  B,  ...  Z) 

Volt  (1)  =  c(A)  X  A( 1)  +  c(B)  x  B( 1)  +  ....  +  c(Z)  X  Z(l)  +  Err(l) 
Volt  (2)  =  c (A)  x  A( 2)  +  c(B)  x  B(2)  +  ....  +  c(Z)  x  Z(2)  +  Err(2) 
Volt  (3)  =  c(A)  x  A(3)  +  c(B)  x  B(3)  +  ....  +  c(Z)  x  Z(3)  +  Err(3) 


Volt  (N) 


•••  ••••  •••  •  •  • 
c(A)  x  A(N)  +  c(B)  x  B(N)  +  ....  +  c(Z)  x  Z(N)  +  Err(N) 


Notes:  Vo 1 t ( 1  to  N)  are  the  observed  values  at  electrode  (1  to  N); 

A(1  to  N),  B(1  to  N),  etc.  are  the  weights  of  the  spatial  filters, 
corresponding  to  components  (A  to  Z),  at  electrode  (1  to  N)  - 
these  weights  are  assumed  to  be  known; 

c(A  to  Z)  are  the  coefficients  that  are  estimated  by  the  least 
squares  procedure  that  minimizes  the  sum  of  the  squared  errors 
(Err(l  to  N))  for  each  electrode. 
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Table  2.  Mean  P300  amplitude  at  each  electrode  (microvolts) 


Definition 

Fz 

Cz 

Pz 

Target  Effect/Audi tory 

0.1 

0.8 

0.7 

Probability  Effect/Auditory 

2.8 

5.1 

4.3 

Omitted  Stimulus/Auditory 

-1.5 

4.1 

6.9 

Target  &  Prob.  Effects/Visual 

1.2 

2.6 

3.1 

Average 


0.65 


3.15 


3.75 
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Table  3.  R ( 12s )  of  the  scalp  distributions  of  different  definitions  of  P300 


Definition  TE/A  PE/A  OS/A  TPE/V 


Target  Effect/Auditory 

Probability  Effect/Auditory  .957 

Omitted  Stimulus/Auditory  .918  .795 

Target  &  Prob.  Effects/Visual  .972  .990  .907 

Average 


.986  .956  .945  .993 
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Figure  Legends 

Fig  1.  First  row:  Scalp  distribution  of  two  hypothetical  ERP 
components  (Components  A  and  B).  Second  row:  Time  course  of  Components  A 
and  B.  Third  row:  Composite  ERP  waveform  given  by  the  sum  of  Components  A 
and  B.  Fourth  row:  Time  course  of  Components  A  and  B  as  retrieved  by  the 
Vector  filter  procedure. 

Fig  2.  First  row:  Scalp  distribution  of  two  hypothetical  ERP 
components  (Components  A  and  B).  Second  row:  Time  course  of  Components  A 
and  B.  Third  row:  Composite  ERP  waveform  given  by  the  sum  of  Components  A 
and  B.  Fourth  row:  Time  course  of  Component  A  as  retrieved  by  the  Vector 
filter  procedure  when  only  Component  A  is  filtered  for. 

Fig  3.  First  row:  Scalp  distribution  of  three  hypothetical  ERP 
components  (Components  A,  B,  and  C).  Second  row:  Time  course  of  Components 
A,  B,  and  C.  Third  row:  Composite  ERP  waveform  given  by  the  sum  of 
Components  A,  B,  and  C.  Fourth  row:  Time  course  of  Components  A  and  B  as 
retrieved  by  the  Vector  filter  procedure. 

Fig  4.  From  Miller  et  al.  (in  preparation).  Grand  average  waveforms 
at  Fz  (solid),  Cz  (dashed),  and  Pz  (dotted)  for  each  task  and  stimulus 
( n= 10) . 
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Abstract 

We  compared  the  accuracy  of  P30Q  latency  estimates  obtained  with 
different  procedures  under  several  simulated  signal  and  noise  conditions. 
Both  preparatory  and  signal  detection  techniques  were  used.  Preparatory 
techniques  included  frequency  filters  and  spatial  filters  (single  electrode 
selection  and  Vector  filter).  Signal  detection  techniques  included 
peak-picking,  cross-correlation,  and  Woody  filter.  Accuracy  in  the  latency 
estimation  increased  exponentially  as  a  function  of  the  slgnal-to-nolse 
ratio.  Both  Woody  filter  and  cross-correlation  provided  better  estimates 
than  peak-picking,  although  this  advantage  was  reduced  by  frequency 
filtering.  For  all  signal  detection  techniques.  Vector  filter  provided 
better  estimates  than  single  electrode  selection.  Large  component  overlap 
impaired  the  accuracy  of  the  estimates  obtained  with  single  electrode 
selection,  but  it  impaired  the  accuracy  of  the  estimates  obtained  with 
Vector  filter  only  when  the  overlapping  component  had  a  scalp  distribution 
similar  to  that  of  the  signal  component.  The  effects  of  varying  noise 
characteristics,  P300  duration  and  latency,  as  well  as  the  parameters  of 
Vector  filter  were  also  Investigated. 

Descriptors:  P300  Latency  Measures,  Simulations,  Frequency  Filters,  Spatial 
Filters,  Vector  Filter,  Peak-picking,  Cross-correlation,  Woody  Filter. 
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In  this  paper  we  compare  several  of  the  techniques  that  are  commonly 
jsed  for  the  measurement  of  the  latency  of  components  of  the  Event-Related 
Brain  Potential  (ERP).  In  this  paper  we  will  focus  on  one  particular 
component,  the  P300.  Theoretical  speculation  (e.g.,  Coles  &  Gratton,  1985; 
Coles,  Gratton,  Bashore,  Eriksen,  &  Donchin,  1985;  Donchin,  1981)  and 
empirical  evidence  (e.g.,  Ouncan-Johnson  &  Donchin,  1982;  Kutas,  McCarthy,  & 
Donchin,  1977;  McCarthy  &  Donchin,  1981;  Ragot,  1984)  suggest  the  utility  of 
the  latency  of  P300  in  the  study  of  cognition.  However,  since  ERP 
components  are  estimated  from  data  containing  substantial  amounts  of  noise, 
the  measures  of  latency  necessarily  yield  approximations  to  the  “true" 
latency  of  the  components.  In  the  studies  reported  here,  we  simulated  ERP 
data  so  that  known  ERP  components  were  embedded  in  noise  of  different 
amounts  and  qualities.  The  degree  to  which  different  procedures  accurately 
estimated  the  known  parameters  of  the  ERP  components  was  then  assessed. 

The  need  for  such  a  comparison  is  evident.  While  the  Importance  of 
measuring  latencies  is  generally  acknowledged,  there  is  little  consensus 
regarding  the  methods  to  be  used  (see  Fabiani,  Gratton,  Karis,  &  Donchin,  In 
press,  for  an  overview  of  different  procedures  used  for  the  definition  of 
one  particular  ERP  component  -  the  P300).  Seemingly  conflicting  results 
appear,  and  the  conflict  may  be  due  entirely  to  the  incommensurability  of 
the  methods.  Thus,  our  results  can  be  used  as  a  source  of  guidelines  for 
the  choice  of  an  appropriate  procedure  for  the  estimation  of  component 
latency.  Of  course,  such  guidelines  are  confined  to  the  domain  explored  by 
our  studies. 

A  procedure  designed  to  estimate  the  latency  of  an  ERP  component  should 


take  into  account  the  characteristics  of  both  the  signal  (i.e.,  the  ERP 
component  of  interest)  and  the  noise  (i.e.,  all  the  other  electrical 
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activity  recorded  by  the  scalp  electrode).  From  the  point  of  view  of  the 
ERP  researcher,  there  are  at  least  two  categories  of  noise.  The  electrodes 
record,  in  addition  to  the  component  of  interest,  the  background  EEG  (random 
noise)  and  the  activity  of  concurrent  ERP  components  (systematic  noise). 1 
(Systematic  noise  refers  to  the  fact  that  this  type  of  noise  does  not 
average  to  zero  across  a  large  number  of  trials.)  Previous  studies  that 
have  simulated  ERPs  have  focussed  on  the  effect  of  random  noise  (intended  to 
simulate  background  EEG)  on  the  accuracy  of  latency  estimates  (see 
Pfefferbaum,  1983;  Woody,  1967).  However,  a  thorough  evaluation  of  latency 
estimation  procedures  can  be  obtained  only  when  the  effects  of  both  random 
and  systematic  noise  are  examined. 

The  term  "background  EEG"  refers  to  the  electrical  brain  activity  which 
is  not  time-locked  to  the  external  event.  It  is  generally  assumed  that  the 
background  EEG  has  a  random  phase  in  relation  to  the  triggering  event. 
However,  this  activity  is  technically  not  random  (i.e.,  not  white  noise),  as 
some  frequency  bands  may  be  dominant.  The  presence  of  strong 
auto-correlation  functions  in  the  background  EEG  activity  may  affect  signal 
detection  procedures  which  are  based  on  the  auto-regressive  properties  of 
the  signal  (cf.  auto-correlation  procedures).  In  fact,  the  background  EEG 
activity  may  share  some  characteristics  with  the  signal  (for  instance, 
frequency).  Such  noise  may  lead  to  the  detection  of  a  component  when  there 
is  only  noise  present  (Pfefferbaum,  1983).  To  allow  the  sources  of  such 
"false  alarms"  to  play  a  role  in  the  simulation,  the  frequency 
characteristics  of  the  background  EEG  must  be  reproduced  in  the  simulated 
waveforms. 

Unfortunately,  the  characteristics  of  the  background  EEG  activity 
occurring  during  an  ERP  experiment  are  not  well  known.  Several  studies  have 


Simulation  studies  of  latency  measures 


6 


concurrently  examined  ERP  and  EEG  frequency  power  spectra  (for  example,  see 
McCarthy  g,  Donchin,  1978).  However,  the  presence  of  ERPs  makes  the 
frequency  power  spectra  that  are  obtained  with  such  procedures  poor 
estimates  of  the  background  EEG  activity  (see  Ungar  &  Basar,  1976,  for  a 
related  discussion  on  the  utility  of  “Wiener  filter,"  Wiener,  1949).  In  our 
studies,  we  simulated  the  background  noise  by  using  the  deviation  of  single- 
trial  ERPs  from  the  average  ERP. 

The  second  source  of  noise  is  provided  by  those  ERP  components  that 
temporally  and  spatially  overlap  the  target  component.  In  this  case,  since 
ERP  components  are  believed  to  be  invariant  across  trials,  the  average  value 
_  of  the  noise  over  a  large  number  of  trials  represents  an  estimate  of  the 
overlapping  component(s) .  Note  that  this  value  Is  not  equal  to  zero  as  Is 
the  case  with  averages  of  random  noise.  The  error  induced  by  such 
systematic  noise  is  particularly  Insidious  because  it  may  vary  as  a  function 
of  experimental  manipulations.  Furthermore,  latency  detection  procedures 
might  be  differentially  affected  by  overlapping  components.  To  study  the 
effects  of  overlapping  components  on  the  accuracy  of  latency  estimates,  we 
added  a  set  of  components  to  the  waveforms.  The  components  varied  in  the 
degree  of  temporal  and  spatial  overlap  with  the  target  component.  The 
amplitude  and  latency  of  the  components  were  also  systematically  varied. 

The  relative  amplitudes  of  the  component  (signal)  and  of  the  noise  are 
important  in  determining  the  accuracy  of  the  detection  of  the  component. 
Several  studies  have  demonstrated  that  detection  accuracy  Increases 
monotonically  with  Increases  of  the  signal-to-nolse  ratio  (Nahvl,  Woody, 
Ungar,  &  Sharafat,  1975;  Pfefferbaum,  1983;  Wastell,  1977;  Woody,  1967). 
However,  different  procedures  may  be  differentially  affected  by  the  same 
increase  in  the  signal-to-nolse  ratio,  and  a  procedure  that  is  more  accurate 
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at  one  s ignal-to-noi se  ratio  may  be  less  accurate  at  another  (see  a  related 
discussion  about  Wiener  filter  in  Wastell,  1981). 

In  our  studies,  we  distinguish  between  two  classes  of  techniques  which 
are  used  in  the  estimation  of  the  latency  of  ERP  components:  (a) 
preparation,  or  filtering,  procedures,  and  (b)  signal  detection  procedures. 
The  purpose  of  the  filtering  techniques  is  to  increase  the  signal-to-noise 
ratio  by  capital i 2 i ng  on  the  differences  in  the  physical  properties  of  the 
signal  and  of  the  noise.  These  techniques  prepare  the  data  for  the  latency 
estimation,  that  is  carried  out  by  the  signal  detection  procedures.  We  will 
consider  two  kinds  of  filtering  techniques,  frequency  filters  and  spatial 
filters. 

The  function  of  frequency  filters  is  to  eliminate  electrical  activity 
of  undesirable  frequencies.  However,  the  effect  of  frequency  filters  on 
latency  estimates  has  not  been  thoroughly  explored.  Both  on-line  analog  and 
off-line  digital  filters  are  used  (for  a  discussion  of  both  kinds  of  filters 
see  Coles,  Gratton,  Kramer,  &  Miller,  1986).  On-line  analog  filters 
generally  introduce  phase  shifts  which  result  in  distortions  of  the  latency 
estimates.  The  magnitude  of  the  phase  distortion  depends  on  the  band-pass 
characteristics  of  the  filter  (see  Duncan- Johnson  and  Donchin,  1977,  for  a 
discussion  of  the  effect  of  high-pass  filters  on  ERP  waveforms).  For  this 
reason  most  researchers  use  broad-band  filters  in  the  collection  of  ERP 
data.  Off-line  digital  filters  can  be  designed  in  such  a  way  as  to  avoid 
the  introduction  of  phase  shifts.  Such  filters  may  be  used  during  the 
preparation  of  the  ERP  data  for  signal  detection  techniques.  In  our 
studies,  we  will  focus  on  low-pass  off-line  digital  filters  with  no  phase 
distortion  (Ruchkin  &  Glaser,  1978). 

Scalp  distribution  information  has  not  often  been  used  in  the 
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preparation  of  data  for  latency  estimation  (see  Nahvi  et  al.,  1975,  for  an 
attempt  to  use  multielectrode  information  for  improving  signal  detection). 

In  general,  researchers  have  simply  selected  one  electrode  location  to  use 
for  further  analysis.  (We  will  label  this  procedure  "single  electrode 
selection").  One  of  the  goals  of  this  paper  is  to  evaluate  the  use  of  scalp 
distribution  information  to  improve  the  detection  of  a  component  at 
different  signal-to-noise  ratios.  Scalp  distribution  information  can  be 
used  by  adopting  a  procedure  (Vector  filter),  described  by  Gratton,  Coles, 
and  Oonchin  (submitted). 2  Vector  filter  assumes  that  scalp  distribution  Is 
a  defining  characteristic  of  an  ERP  component.  Therefore  information  about 
scalp  distribution  can  be  employed  to  discriminate  among  ERP  components  and 
between  ERP  components  and  noise.  We  will  compare  the  accuracy  of  the 
latency  estimates  obtained  from  data  prepared  with  Vector  filter  with  that 
of  those  obtained  by  selecting  a  single  electrode.  Vector  filter  can  be 
thought  of  as  a  linear  filter  where  the  weights  for  each  electrode  are 
chosen  to  optimize  the  detection  of  the  target  component.  Single  electrode 
selection  can  be  thought  of  as  a  linear  filter  with  a  weight  of  1  assigned 
to  the  selected  electrode,  and  a  weight  of  0  assigned  to  all  the  other 
electrodes. 

The  purpose  of  a  signal  detection  technique  Is  to  detect  the  signal 
under  noisy  conditions  (for  a  review  of  signal  detection  techniques  in  ERP 
research,  see  Coles  et  al.,  1986).  Two  types  of  signal  detection  techniques 
are  commonly  used  by  ERP  researchers  in  the  study  of  a  component's  latency: 
peak-picking  and  cross-correlation.  These  techniques  differ  in  the  way  a 
signal  is  defined.  Peak-picking  identifies  a  con^onent  as  the  maximum  or 
minimum  value  in  a  certain  time  window.  Note  that  only  the  time-point  at 
which  the  peak  occurs  is  used  to  estimate  the  amplitude  and  latency  of  the 
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component.  Cross-correlation  techniques  define  a  component  in  terms  of  Its 
waveshape  (Derbyshire,  Driessen,  &  Palmer,  1967;  Palmer,  Derbyshire,  &  Lee, 
1966).  They  consider  a  component  as  that  segment  of  a  waveform  whose  shape 
maximally  "resembles"  an  externally  defined  segment  of  waveform,  labelled 
the  "template."  "Resemblance"  is  assessed  by  means  of  a  correlation  or 
covariance  measure.  Component  latency  is  defined  as  the  "lag"  between  the 
template  and  the  waveform  necessary  to  produce  the  maximum  resemblance. 

Woody  (1967)  proposed  a  particular  variant  of  the  cross-correlation 
procedure,  where  the  template  is  "adapted"  to  the  average  waveform  through 
several  iterations  (Woody  filter). 3  Since  filtering  affects  the 
signal-to-noise  ratio,  its  impact  on  different  signal  detection  procedures 
will  also  be  considered  in  the  present  paper. 

There  were  three  phases  to  our  studies.  First,  simulated  waveforms 
were  generated  according  to  a  particular  model  of  ERPs.  Second,  procedures 
were  applied  to  these  waveforms  to  obtain  latency  estimates.  These 
procedures  included  filtering  and  signal  detection  techniques.  Finally,  the 
accuracy  of  the  latency  estimates  was  computed,  and  the  merits  of  each 
procedure  were  evaluated. 

We  should  note  the  differences  between  the  basic  design  (labelled 
Simulation  1)  and  the  control  conditions  we  ran  to  Investigate  particular 
problems  (labelled  Simulation  2,  3,  4,  5,  and  6). 

Simulation  1:  Basic  Design 

In  this  section  we  d'scuss  the  results  obtained  from  the  basic  design. 
We  shall  present  a  series  of  additional  analyses  we  ran  to  investigate  the 
effect  of  non-standard  1  zed  background  noise,  variations  of  P300  duration  and 
latency,  and  variations  of  the  parameters  of  Vector  filter,  on  the  accuracy 
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of  the  latency  estimates  in  later  sections. 

Method 

The  basic  design  consisted  of  a  factorial  combination  of  several 
simulated  conditions  and  analytic  procedures.  The  simulated  conditions 
yielded  220  different  conditions  (11  P300  amplitude  levels  x  20  overlapping 
component  conditions).  For  each  condition  we  obtained  100  repetitions  by 
adding  background  EEG  noise  (to  be  described  in  a  subsequent  section), 
sampled  at  random  and  with  reselection  from  a  set  of  100  background  noise 
trials  (note  that  the  relationship  between  waveform  and  recording  electrode 
-  Fz ,  Cz,  and  Pz  -  was  maintained).  This  procedure  was  repeated  five  times, 
once  for  each  of  the  five  filtering  conditions  (four  filter  settings  and  one 
control  condition  where  no  filter  was  used).  This  produced  a  total  of 
110,000  waveforms  for  each  of  the  three  electrodes.  Each  of  these  sets  of 
waveforms  was  filtered  spatially  using  either  single  electrode  selection  or 
Vector  filter.  Finally,  we  applied  four  signal  detection  algorithms 
(peak-picking,  cross-correlation,  Woody  filter  with  2  iterations,  and  Woody 
filter  with  3  iterations)  to  obtain  latency  estimates  for  each  of  the  2  x 
110,000  waveforms. 

Note  that  evaluation  of  the  different  signal-detection  algorithms  and 
of  the  different  spatial  filtering  techniques  was  based  on  their  performance 
for  the  same  set  of  waveforms.  However,  comparisons  among  different 
frequency  filtering  procedures,  overlap  conditions,  and  P300  amplitudes  were 
based  on  different  waveforms,  obtained  by  sampling  independently  from  the 
set  of  background  noise  trials.  A  list  of  the  experimental  conditions  for 
the  basic  design  is  shown  in  Table  1. 
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Insert  Table  1  about  here 


Model .  Our  simulations  were  based  on  the  following  model: 

Volt ( it)  =  cP300(t)  x  P300(i )  +  SUM  (cOVER(jt)  x  OVER(ij))  +  N(it) 
where: 

Volt(it)  is  the  potential  recorded  at  electrode  i  at  time  t; 

cP300(t)  is  the  amplitude  of  the  P300  component  at  time  t; 

P300 ( i )  is  the  weight  of  the  P300  component  at  electrode  i; 

cOVER (jt)  is  the  amplitude  of  the  overlapping  component  j  at  time  t; 

OVER(ij)  is  the  weight  of  the  overlapping  component  j  at  electrode  1; 

N(it)  is  the  background  EEG  noise  at  electrode  i  at  time  t. 

In  adopting  this  model,  we  assume  that  each  component  is  characterized 
by  a  particular  scalp  distribution,  defined  by  a  series  of  weights,  one  for 
each  electrode  location.  This  assumption  is  similar  to  that  used  in  the 
Vector  filter  procedure  (Gratton  et  al . ,  submitted,  and  see  Footnote  2). 

Single  trials.  Each  assessment  of  the  accuracy  of  the  latency 
estimates  was  based  on  100  repetitions,  at  three  different  electrode 
locations  (labelled  Fz,  Cz,  and  Pz).  Each  repetition  was  called  a  "trial" 
and  was  obtained  by  adding  several  time  series  (vectors)  point  by  point. 

The  trials  were  constructed  by  adding  a  different  noise  vector  to  each  of 
100  identical  component  vectors.  The  vectors  consisted  of  128  data  points, 
considered  to  have  been  recorded  at  100  Hz  digitizing  rate,  starting  200 
msec  before  a  hypothetical  stimulus.  The  average  of  the  first  20  points  was 
considered  as  an  estimate  of  the  "pre-stimulus"  baseline  level  and  was 
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subtracted  from  the  data. 

Simulation  of  ERP  components.  One  target  and  four  non-target 
components  were  obtained  by  adding  together  five  cosinusoidal  waves.  The 
amplitude,  latency,  and  duration  (wavelength)  of  the  cosinusoidal  waves 
could  vary,  and  each  component  was  simulated  by  using  different  parameters. 
The  scalp  distribution  of  each  component  was  simulated  by  multiplying  the 
vector  by  a  different  scaling  factor  for  each  electrode.  The  target 
component  simulated  the  "P300"  component,  and  the  non-target  components 
simulated  the  "N100",  "P200",  "N200",  and  "Slow  Wave"  components.  The 
parameters  (amplitude,  latency,  duration,  and  scalp  distribution)  of  each 
component  do  not  correspond  to  data  obtained  in  a  particular  experiment. 
However,  an  attempt  was  made  to  reproduce  the  parameters  of  the  components 
as  described  in  the  ERP  literature  (see  Donchin,  Ritter,  &  McCallum,  1978). 
The  parameters  of  N100  and  P200  were  not  varied  systematically,  since  they 
do  not  overlap  temporally  with  P300  in  our  data  set,  and,  therefore,  should 
not  affect  the  P300  latency  estimates.  The  amplitude  and  latency  of  N200 
and  Slow  Wave  were  systematically  varied  to  simulate  the  different  degrees 
of  overlap  with  P300  that  may  be  present  in  observed  data.  A  control 
condition  in  which  only  the  P300  component  was  present  was  also  included  in 
the  study.  The  parameters  adopted  for  each  component  are  presented  In  Table 
2. 


Insert  Table  2  about  here 


P300  amplitude  was  varied  systematically  from  0  (absence  of  the 


component)  to  500  units,  with  Increments  of  50  units.  Each  unit  was 
intended  to  be  equivalent  to  0.1  microvolts,  so  that  P300  amplitude  varied 
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from  0  to  50  microvolts.  This  manipulation  of  P300  amplitude  allowed  us  to 
evaluate  the  different  procedures  over  a  wide  range  of  signal  amplitude 
conditions.  Given  the  complex  procedure  we  adopted  to  simulate  noise 
(described  below),  and  in  particular  the  presence  of  systematic  noise,  we 
could  not  express  the  true  signal-to-nolse  ratio  as  an  absolute  value. 
However,  we  could  compute  the  ratio  between  the  amplitude  of  the  signal  and 
the  root  mean  square  amplitude  of  the  background  EEG  noise  that  was  fixed  at 
100.  We  chose  to  label  this  value  "signal-to-noise  ratio."  (Strictly 
speaking  the  value  should  be  referred  to  as  the  "signal-to-background-nolse 
ratio".  For  reasons  discussed  earlier,  we  do  not  Include  systematic  noise, 
or  component  overlap.  In  this  estimate.)  This  ratio  varied  systematically 
from  0  to  5,  in  half  unit  Increments.  Both  N200  and  Slow  Wave  partially 
overlapped  with  P300.  N200  had  a  frontally  maximum  scalp  distribution  that 

was  clearly  different  from  that  of  P300.  The  scalp  distribution  of  the  Slow 
Wave  was  maximally  positive  at  Pz,  and  was  therefore  more  similar  to  that  of 
P300.  These  manipulations  allowed  us  to  study  the  impact  of  different 
levels  of  component  overlap  (using  manipulations  of  latency,  amplitude,  and 
scalp  distribution)  on  the  P300  latency  estimates.  The  five  components  were 
added  to  obtain  complex  ERP  waveforms.  An  example  of  one  set  of  these 
waveforms  is  presented  in  the  upper  panel  of  Figure  1. 


Insert  Figure  1  About  Here 


Component  amplitudes  and  latencies  were  factorlally  combined.  The 
design  Included  2x2  (amplitude  and  latency)  N200  manipulations,  2x2  Slow 
Wave  manipulations,  and  four  repetitions  of  the  control  condition  (where  the 
amplitudes  of  the  overlapping  component  were  zero),  with  a  total  of  20 
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conditions  of  component  overlap.  As  there  were  11  levels  of  P300  amplitude, 
220  basic  ERP  waveforms  were  obtained  for  each  electrode. 

Background  EEG  noise.  Estimates  of  background  EEG  activity  were 
obtained  from  a  set  of  100  trials  recorded  from  an  individual  subject  in  an 
oddball  experiment  (Fabiani  et  al.,  in  press). 4  The  background  EEG  was 
obtained  by  subtracting  the  appropriate  average  from  each  single-trial 
record.  A  power  spectral  analysis  of  these  waveforms  revealed  maxima  in  the 
following  bands:  0-4  Hz  for  all  electrodes,  8-10  Hz  for  the  parietal 
electrode,  and  14-16  Hz  for  all  electrodes. 

This  procedure  yielded  a  set  of  waveforms  whose  average  is  a  flat  line. 
However,  the  variability  from  trial  to  trial  Is  not  equal  for  all 
time-points  and  electrodes.  In  particular,  larger  Intertrial  variance  was 
observed  at  a  latency  of  approximately  300  msec  (perhaps  due  to  sampling 
variation),  and  smaller  variance  during  the  pre-stimulus  period.  Under 
these  conditions,  these  waveforms  could  not  be  considered  stationary  over 
the  whole  epoch,  and,  therefore  they  could  not  be  considered  as  good 
estimates  of  the  background  EEG  noise.  A  further  disadvantage  was  that  the 
impact  of  noise  could  vary  as  a  function  of  the  latency.  Therefore  we  chose 
to  standardize  each  time-point  (and  electrode),  with  a  mean  of  0,  and  a 
standard  deviation  of  100  units  (10  microvolts).  We  considered  the 
resulting  100  waveforms  for  each  electrode  as  our  simulated  background  EEG 
activity.  An  analysis  of  the  frequency  characteristics  of  the  simulated 
background  noise  after  the  standardization  did  not  reveal  any  difference 
from  the  "non-standardlzed"  noise.  However,  we  also  ran  a  control  condition 
with  non-standardlzed  waveforms  (Simulation  2). 

An  example  of  single-trial  waveforms  obtained  by  adding  the  complex  ERP 
waveform  and  the  standardized  background  noise  Is  shown  in  the  lower  panel 
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of  Figure  1.  Note  that  the  waveforms  obtained  by  adding  the  noise  are 
different  in  several  aspects  from  those  not  containing  noise.  In  particular 
the  relationship  among  electrodes  is  altered,  double  peaks  are  noticeable, 
etc. 

Off-1 ine  Frequency  Filters.  The  study  included  a  comparison  among  four 
off-line,  low-pass  frequency  filtering  procedures,  and  a  no  filtering 
condition.  All  the  filters  were  based  on  a  moving  average  (Ruchkln  & 

Glaser,  1978).  The  procedure  differed  in  the  number  of  consecutive 
time-points  used  for  the  smoothing  (length),  and  in  the  number  of  iterations 
of  the  procedure  adopted.  Two  length  levels  (7  and  13  points),  and  two 
iteration  levels  (1  and  2  iterations)  were  used.  Moving  average  filters 
cannot  be  perfectly  described  in  terms  of  the  ha If -amplitude  cut-off 
frequencies  (Ruchkin  &  Glaser,  1978).  However,  a  running  average  of  7 
points  corresponds  roughly  to  a  -3  dB  cut-off  point  at  6.29  Hz  when  one 
iteration  is  used,  and  at  4.43  Hz  when  two  Iterations  are  used.  For  13 
consecutive  points  the  corresponding  values  are  3.38  Hz  and  2.38  Hz, 
respectively. 

We  should  emphasize  that  the  comparison  between  filtering  procedures 
described  above  does  not  exhaust  all  off-line  frequency  filters  available  to 
the  investigator.  However,  Simulation  5  considers  the  impact  of  a  wider 
range  of  less  conroonly  used  filters  (up  to  1.42  Hz). 

Spatial  Filters.  Two  types  of  spatial  filters  were  compared:  single 
electrode  selection  and  Vector  filter. 

a.  Single  electrode  selection.  This  procedure  consists  of  the 
selection  of  one  electrode  for  further  analysis.  Given  that  our  P300 
component  was  maximum  at  Pz,  we  chose  this  electrode  for  the  analysis. 

b.  Vector  Filter.  This  procedure,  described  by  Gratton  et  al . 
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(submitted),  is  based  on  the  assumption  that  ERP  components  are 
characterized  by  a  specific  scalp  distribution  that  can  be  expressed  by  a 
series  of  weights,  one  for  each  electrode. 

In  the  present  case,  we  used  a  Vector  filter  characterized  by  the 
following  weights:  0.15  for  Fz,  -0.53  for  Cz,  and  0.83  for  Pz.  The  choice 
of  these  weights  was  based  on  a  previous  study  in  which  they  were  found  to 
produce  on  optimal  discrimination  between  two  groups  of  rare  and  frequent 
trials.  As  we  have  noted  elsewhere  (Gratton  et  al.,  submitted),  component 
weights  can  be  estimated  by  using  discriminant  analysis  to  determine  the 
optimal  discrimination  between  a  set  of  waveforms  In  which  the  component  of 
interest  is  present  and  another  set  in  which  it  is  absent  (or  smaller). 

Since  the  P300  component  is  larger  for  rare  than  for  frequent  stimuli  (e.g., 
Duncan-Johnson  &  Donchin,  1977),  the  weights  derived  from  the  discriminant 
function  procedure  should  represent  the  P300  component.  Note  that  the  set 
of  weights  does  not  correspond  to  the  typical  P300  distribution.  This  is 
because  the  background  noise  is  not  "random."  Further  discussion  of  this 
point  is  provided  in  this  Simulation  6,  In  which  we  varied  parametrically 
the  weights  of  Vector  filter  to  determine  which  parameters  resulted  In  the 
best  improvement  in  the  estimation  of  P300  latency. 

Signal  detection  techniques.  The  signal  detection  algorithms  included 
peak-picking  and  cross-correlation  techniques.  The  peak-picking  algorithm 
is  based  on  the  detection  of  the  maximum  value  In  a  prespecified  temporal 
window. 

In  this  paper  we  adopted  a  procedure  which  allowed  us  to  evaluate  both 
cross-correlation  and  Woody  filter.  We  used  a  template  equivalent  to  the 
P300  component  that  we  used  to  generate  the  simulated  waveforms  for  the 
first  iteration  of  the  Woody  filter  procedure.  Thus,  this  first  iteration 
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corresponded  to  a  cross-correlation  algorithm,  while  the  subsequent 
iterations  corresponded  to  successive  iterations  of  the  Woody  filter.  The 
estimate  of  P300  latency  was  obtained  by  selecting  the  central  value  of  the 
ERP  segment  with  the  maximum  correlation  with  the  template. 

For  each  signal  detection  procedure,  the  temporal  window  began  300  msec 
post-stimulus  and  ended  800  msec  post-stimulus  (250  msec  before  and  250  msec 
after  the  P300  peak).  The  duration  of  the  template  used  for  the 
cross-correlation  algorithm  was  500  msec. 

Accuracy  estimation.  As  described  above,  100  repetitions  were  obtained 
for  each  condition.  To  assess  the  accuracy  of  latency  estimation  obtained 
with  each  procedure  under  each  condition,  the  root  mean  square  error  (MSE) 
value  was  calculated.  This  value  was  obtained  as  follows: 

n  2 

MSE  =  SQRT (sum  (1  -  L)  /  n 
i=l  i 


where: 

MSE  is  the  root  mean  square  error  of  the  latency  estimates; 

n  is  the  number  of  trials; 

1  is  the  latency  estimate  at  trial  i; 
i 

L  is  the  P300  peak  latency  (550  msec). 

Most  of  the  figures  presented  in  the  following  sections  of  this  paper  show 
variations  of  the  MSE  value  as  a  function  of  variations  of  the 
signal-to-noise  ratio  (l.e.,  P300  amplitude).  In  the  plots  presented  here, 
a  logarithmic  scale  Is  used,  because  we  assumed  that  the  variability  In  MSE 
obtained  with  different  procedures  was  proportional  to  its  absolute  value. 

Another  dependent  variable  we  used  was  an  approximate  estimate  of  the 
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number  of  trials  required  to  reduce  the  standard  error  of  estimate  to  3 

msec.  5 

Results 

Non-overlapping  component  condition.  The  basic  design  was  devised  to 
permit  a  comparison  of  the  accuracy  of  several  procedures  over  a  wide 
variety  of  signal  and  noise  conditions.  As  a  reference  point  we  will  first 
present  the  data  obtained  in  the  condition  in  which  no  overlapping 
components  were  present.  The  MSE  values  (averaged  across  400  repetitions) 
for  this  condition  are  shown  in  Figure  2. 


Insert  Figure  2  About  Here 


As  a  reminder,  in  this  and  most  of  the  following  figures  the  abscissa 
represents  the  signal-to-noise  ratio  (i.e.  P300  amplitude),  while  the 
ordinate  represents  the  log  MSE.  The  MSE  values  obtained  with  the  Woody 
filter  were  similar  to  those  obtained  with  the  cross-correlation  procedure. 

Several  important  effects  are  apparent  in  Figure  2.  First,  variations 
of  the  signal-to-noise  ratio  produced  the  largest  effects  on  the  accuracy  of 
estimation.  As  expected,  at  a  signal-to-noise  ratio  of  0,  all  the 
procedures  produce  about  the  same  results.  The  M5E  at  this  signal-to-noise 
ratio  is  close  to  that  which  would  be  obtained  by  picking  points  at  random 
in  the  temporal  window.  In  fact,  the  log  MSE  obtained  in  the  latter  way  Is 
2.2.  By  increasing  the  signal-to-noise  ratio,  exponential  decreases  of  the 
MSE  can  be  observed  (the  functions  approximate  a  straight  line  in  the  figure 
because  of  the  logarithmic  scale  used  for  the  ordinate).  At  a 
signal-to-noise  ratio  of  5,  the  MSE  value  Is  one  tenth  of  that  found  at  a 
signal-to-noise  ratio  of  0.  Second,  the  use  of  cross-correlation  as  a 
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signal  detection  procedure  yielded  lower  MSE  values  than  peak-picking.  The 
gain  in  accuracy  obtained  with  cross-correlation  may  be  as  high  as  50%,  at 
very  high  signal -to-nolse  ratios,  and  when  no  frequency  filter  is  applied. 
Third,  the  use  of  Vector  filter  as  a  spatial  filtering  technique  reduced  the 
error  in  latency  estimation  in  comparison  with  single  electrode  selection 
(Pz).  This  advantage  is  evident  at  intermediate  and  high  signal-to-noise 
ratios.  At  low  or  intermediate  slgnal-to-nolse  ratios  (0.5  to  2.0),  the 
advantage  of  Vector  filter  Is  comparable  to  that  of  cross-correlation. 
However,  the  advantage  of  Vector  filter  rarely  reaches  the  50%  level  and  Is 
usually  about  25%.  The  advantages  of  Vector  filter  and  of  cross-correlation 
appear  to  be  independent.  Fourth,  low-pass  frequency  filters  produced 
marked  improvements  in  the  accuracy  of  latency  estimation.  The  largest 
improvement  was  obtained  with  a  2.38  Hz  low-pass  filter.  Low-pass  filters 
with  a  higher  cut-off  produced  less  improvement.  However,  this  effect  was 
particularly  evident  when  a  peak-picking  algorithm  was  used  for  signal 
detection.  The  gain  for  cross-correlation  was  small.  In  fact,  the  effect 
of  the  frequency  filters  was  to  bring  peak-picking  to  the  same  level  of 
accuracy  as  cross-correlation.  The  gain  obtained  with  Vector  filter  was 
unaffected,  and  in  fact  the  smallest  MSE  values  were  obtained  by  the  joint 
use  of  frequency  filters,  Vector  filter,  and  cross-correlation. 

Histograms  of  the  latency  estimates  for  each  single  trial  In  the 
non-overlapping  component  condition,  without  frequency  filter,  are  shown  in 

j 

Figure  3. 
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Insert  Figure  3  About  Here 


The  distribution  at  a  signal-to-noise  ratio  of  0  was  approximately 
rectangular,  indicating  that  no  point  was  more  likely  to  be  chosen  than  any 
other  when  no  signal  was  present  (apart  for  a  small  preference  for  the  first 
point  in  the  epoch).  At  higher  signal-to-noise  ratios,  central  values 
become  progressively  more  represented.  The  mode  tends  to  correspond  to  the 
actual  P300  latency.  An  exception  to  this  general  rule  can  be  observed  at  a 
signal-to-noise  ratio  of  2.5  for  the  peak-picking  algorithm  applied  to  Pz 
waveforms,  and  for  all  procedures  at  a  signal-to-noise  ratio  of  1,  where  the 
skewed  distribution  Indicates  the  presence  of  systematic  error. 

Latency-adjusted  average  waveforms  obtained  with  cross-correlation  and 
second  and  third  iterations  of  Woody  filter  for  the  non-overlapping 
component  condition,  at  extreme  levels  of  the  signal-to-noise  ratio  and  no 
frequency  filter,  are  shown  in  Figure  4. 


Insert  Figure  4  About  Here 


Inspection  of  this  figure  reveals  that,  even  when  no  ERP  component  Is 
present  (signal-to-noise  ratio  Is  equal  to  0),  the  latency  adjustment 
procedure  "creates"  one.  When  the  component  Is  large,  the  distortion 
produced  by  the  latency  adjustment  Is  negligible.  In  fact,  at  a 
signal-to-noise  ratio  of  5,  the  amplitude  of  P300  at  Pz  is  500  units,  while 
with  Vector  filter  It  Is  260  units  --  a  reduction  to  52*  of  Pz  amplitude 
that  can  be  predicted  on  the  basis  of  the  weights  used.  This  Interaction  of 
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the  performance  of  latency  adjustment  procedures  with  signal-to-noise  ratio 
is  in  agreement  with  the  results  reported  by  Pfefferbaum  (1983).  The 
artifactual  component  created  by  the  latency  adjustment  appears  larger  when 
the  signal  detection  algorithm  is  applied  to  Pz  waveforms,  than  for 
waveforms  obtained  with  Vector  filter.  This  effect  is  confounded  in  part 
with  the  overall  reduction  in  amplitude  produced  by  Vector  filter.  The 
artifactual  component  appears  also  to  have  the  same  amplitude  if  the  latency 
adjustment  is  obtained  after  the  cross-correlation  procedure.  Woody  filter 
with  one  iteration,  or  Woody  filter  with  two  iterations.  Thus,  the  problem 
does  not  seem  to  be  related  to  the  number  of  iterations,  but  rather  to  the 
latency  adjustment  procedure  per  se. 

Overlapping  component  conditions.  Log  MSE  (and  number  of  trials 
required  to  obtain  an  error  of  3  msec)  for  four  different  component  overlap 
conditions  with  two  spatial  filtering  and  two  signal  detection  procedures 
(without  frequency  filters)  are  shown  in  Figure  5. 


Insert  Figure  5  About  Here 


The  "no-component  overlap"  condition  Is  shown  In  the  upper  left  panel  for 
comparison.  The  other  three  conditions  shown  in  the  figure  were  "small 
component  overlap"  (N200  amplitude  =  50  units,  N20Q  latency  »  300  msec,  Slow 
Wave  amplitude  =  100  units.  Slow  Wave  latency  =  1280  msec),  "large  N200 
overlap"  (N200  amplitude  *  100  units,  N200  latency  =  400  msec.  Slow  Wave 
amplitude  =  100  units.  Slow  Wave  latency  =  1280  msec),  and  "large  Slow  Wave 
overlap"  (N200  amplitude  *  100  units,  N200  latency  *  300  msec.  Slow  Wave 
amplitude  =  200  units.  Slow  Wave  latency  =  1000  msec). 

Inspection  of  this  figure  reveals  that  component  overlap  impaired  the 
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accuracy  obtained  with  each  procedure  to  a  different  degree.  The  relative 
advantage  of  procedures  based  on  Vector  filter  in  comparison  to  those  based 
on  Pz  channel  selection  interacted  with  the  degree  and  type  of  component 
overlap.  In  particular,  the  advantage  was  smaller  for  the  "large  Slow  Wave 
overlap"  condition  than  for  the  other  conditions.  We  should  note  here  that 
the  scalp  distribution  of  Slow  Wave  was  close  to  that  of  P300,  while  the 
scalp  distribution  of  N200  was  quite  different.  Thus,  the  improvement 
obtained  with  Vector  filter  is  not  affected  by  overlapping  components  with  a 
scalp  distribution  different  from  that  of  P30Q,  but  is  affected  by  an 
overlapping  component  with  a  scalp  distribution  similar  to  that  of  P300. 
However,  even  in  the  worst  case  (large  Slow  Wave  overlap)  estimates  obtained 
on  Vector  filtered  data  are  no  worse  than  those  obtained  on  Pz  waveforms. 

For  reasons  of  space,  we  cannot  present  here  the  results  obtained  with 
all  the  other  combinations  of  component  overlap,  slgnal-to-noise  ratio, 
frequency  filtering,  spatial  filtering,  and  signal  detection  algorithm. 
However,  these  results  confirm  the  observations  we  have  presented  so  far. 
Discussion 

The  following  conclusions  can  be  drawn  from  the  data  presented  in  the 
Results  section: 

1.  The  error  of  latency  estimation  decreases  exponentially  as  the 
signal-to-nolse  ratio  Increases. 

2.  Cross-correlation  provides  a  more  accurate  estimate  than 
peak-picking. 

3.  Woody  filter  with  2  or  3  Iterations  is  comparable  to 


cross-correlation  (although  this  result  may  be  due  to  the 
fact  that  the  template  for  cross-correlation  was  the  same  as 
the  target  component). 
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4.  Frequency  filters  improve  markedly  the  accuracy  of  estimates 
obtained  with  peak-picking  and,  to  a  lesser  extent,  with 
cross-correlation. 

5.  Vector  filter  yields  estimates  that  are  more  accurate  than  single 
electrode  selection  (Pz). 

6.  Overlapping  components  impair  the  accuracy  of  estimates  of  single 
electrode  selection,  while  the  latency  estimates  of  Vector 
filtered  data  are  impaired  only  if  the  scalp  distribution  is 
similar  to  that  of  P300  (e.g.  Slow  Wave). 

7.  The  accuracy  improvements  obtained  with  cross-correlation,  Vector 
filter,  and  increases  in  signal-to-nolse  ratio  appear  to  be 
independent.  The  Improvements  in  accuracy  obtained  with 

Vector  filter  and  frequency  filters  are  also  independent. 

8.  The  accuracy  improvements  obtained  with  cross-correlation  and 
frequency  filtering  are  not  additive  --  that  is,  the  combined 
use  of  both  these  procedures  does  not  produce  more  accurate 
estimates  than  either  of  them  alone. 

9.  Latency-adjustment  procedures  may  "create"  artif actual 
components.  This  is  especially  apparent  at  small  signal-to-noise 
ratios.  However,  this  phenomenon  is  less  evident  when  Vector 
filtered,  rather  than  Pz,  data  are  considered. 

Some  of  the  findings  described  above  may  be  related  to  the  particular 
conditions  used  in  this  study.  The  procedures  adopted  were  largely 
arbitrary  (although  we  did  attempt  to  simulate  veridical  conditions),  and 
variations  of  some  of  the  parameters  may  have  a  crucial  Impact  on  the 
accuracy  of  latency  estimates.  In  order  to  generalize  these  findings  to  a 
wider  variety  of  situations,  five  additional  analyses  were  performed.  These 
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analyses  explored  the  effects  of  five  variables  on  the  accuracy  of  latency 
estimates:  (a)  standardizing  background  EEG  noise,  (b)  varying  the  duration 
of  P300,  (c)  varying  the  latency  of  P300,  (d)  varying  the  parameters  of  the 
frequency  filter,  and  (e)  varying  the  parameters  used  for  Vector  filter. 

Simulation  2:  Effect  of  standardizing  background  EEG  noise. 

Our  simulation  of  background  EEG  noise  included  the  standardization, 
across  trials  and  separately  for  each  time-point,  of  single-trial 
deviations.  The  purpose  was  to  obtain  comparable  variance  across  the  whole 
epoch.  However,  this  might  affect  the  veridicality  of  our  simulation 
procedure. 

Method 

To  investigate  further  the  effects  of  the  standardization  procedure,  we 
ran  part  of  the  basic  design  (Simulation  1)  on  non-standardized  waveforms. 
The  replication  was  exact,  apart  from  the  absence  of  frequency  filtering. 
Note  also  that  the  signal-to-noise  ratio  for  non-standardized  waveforms 
could  not  be  exactly  determined.  However,  P300  amplitude  was  manipulated  as 
in  the  basic  design,  and  the  level  of  noise  in  the  P300  region  was  roughly 
comparable  to  that  of  the  basic  design.  Thus,  the  same  scale  was  adopted 
for  the  signal-to-noise  ratio  manipulation. 

Results 

Some  of  the  results  obtained  with  non-standardized  noise  are  presented 
in  Figure  6. 
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Insert  Figure  6  About  Here 


Discussion 

Comparison  of  the  accuracy  of  latency  estimation  with  standardized 
(Figure  5,  upper  left  panel)  and  non-standardized  background  noise  (Figure 
6)  indicates  that  the  standardization  procedure  did  not  significantly  affect 
the  results.  All  the  findings  were  replicated.  Thus,  the  standardization 
procedure  did  not  impair  the  veridical ity  of  the  simulation. 

Simulation  3:  Effect  of  P300  duration. 

Since  the  template  we  used  for  the  cross-correlation  technique  was  the 
same  as  the  target  component  Itself,  the  detection  of  P300  obtained  with 
this  algorithm  may  be  more  accurate  than  that  which  could  be  obtained  with 
observed  data,  when  the  "true"  P300  waveshape  is  not  perfectly  known. 
Therefore,  Simulation  1  does  not  really  address  the  question  of  the 
advantage  of  iterating  with  Woody  filter.  In  Simulation  3,  we  considered 
cases  in  which  we  do  not  have  a  good  representation  of  the  P300  waveshape. 
Method 

To  this  end,  we  varied  systematically  the  duration  of  the  simulated 
P300  between  200  and  800  msec,  with  increments  of  100  msec.  However,  we  did 
not  vary  the  wavelength  of  the  template  used  for  the  cross-correlation 
procedure.  P300  amplitude  was  fixed  at  250  units  (corresponding  to  a 
signal-to-noise  ratio  of  2.5).  Background  EEG  noise  was  simulated  through 
standardized  waveforms,  but  no  overlapping  components  were  added.  Ho 
frequency  filtering  was  applied. 
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Results 

Log  MSE  (and  number  of  trials  required  to  obtain  a  standard  error  of  3 
msec)  as  a  function  of  P300  duration  for  two  spatial  filtering  procedures 
(Pz  selection  and  Vector  filter)  and  four  signal  detection  algorithms  (peak¬ 
picking,  cross-correlation,  WocJy  filter  with  two  iterations,  and  Woody 
filter  with  three  iterations)  are  shown  in  Figure  7. 


Insert  Figure  7  About  Here 


Inspection  of  this  figure  reveals  several  noteworthy  findings.  First, 
the  accuracy  of  latency  estimation  depends  on  the  duration  of  P300.  The 
sharper  the  P300,  the  more  accurate  the  estimate.  This  Is  particularly  true 
for  the  peak-picking  procedure  (especially  If  used  In  conjunction  with 
Vector  filter)  and  Woody  filter.  For  cross-correlation,  the  most  accurate 
estimation  is  obtained  when  the  duration  of  the  P300  Is  slightly  shorter 
(400  msec)  than  that  of  the  template.  When  the  duration  of  the  component  Is 
shorter  than  that  of  the  template,  peak-picking  and  Woody  filter  produce 
estimates  equal  to  or  more  accurate  than  cross-correlation.  However,  when 
the  duration  of  the  component  is  longer  than  the  duration  of  the  template, 
cross-correlation  yields  better  estimates. 

Discussion 

These  results  suggest  that  peak-picking  and  Woody  filter  produce 
accurate  estimates  In  cases  of  sharp  components.  For  peak-picking,  this  Is 
not  surprising.  For  Woody  filter,  it  may  be  that  this  procedure  produces 
sharper  templates  at  each  iteration.  Thus,  iterating  with  Woody  filter  may 
be  advantageous  when  the  original  template  has  a  longer  wavelength  than  the 
target  component,  but  disadvantageous  when  the  original  template  has  a 
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shorter  wavelength  than  the  target  component.  It  is  interesting  to  note 
that,  in  cases  of  latency  jitter,  the  averages  tend  to  be  "smooth"  and  the 
components  "widened."  Therefore,  using  the  "unadjusted"  average  waveform  as 
the  template  for  the  first  iteration  of  Woody  filter  may  be  appropriate  in 
cases  in  which  the  duration  of  the  target  component  is  not  known. 

Simulation  4:  Effect  of  variability  of  P300  latency. 

The  second  and  third  iterations  of  the  Woody  filter  procedure  use  the 
average  waveform  as  the  template.  If  the  latency  of  the  component  varies 
from  trial  to  trial,  then  the  average  will  not  provide  an  accurate 
representation  of  the  ERP.  The  rationale  for  Iteration  Is  that  a  more 
accurate  template  will  be  obtained  by  aligning  single  trials  on  the  basis  of 
single  trial  latency  estimates,  and  then  deriving  a  new  average.  Of  course, 
iterations  will  only  be  useful  when  the  waveshape  of  the  target  component  is 
not  known.  In  Simulation  4,  we  investigated  the  effects  of  latency  jitter 
and  of  lack  of  knowledge  about  the  waveshape  of  the  target  component. 

Method 

The  latency  of  P300  was  varied  by  sampling  from  a  distribution  with  a 
mean  of  550  msec  and  a  standard  deviation  of  83  msec,  and  the  duration  of 
P300  was  varied,  with  three  levels,  300,  500,  and  700  msec.  The  duration  of 
the  template  for  cross-correlation  (and  first  iteration  of  Woody  filter)  was 
fixed  at  500  msec.  The  other  parameters  of  this  simulation  are  reported  In 
Table  3. 

Results 


The  results  of  this  analysis  are  presented  in  Table  3. 
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Insert  Table  3  About  Here 


Discussion 

These  results  indicate  that,  in  conditions  of  latency  jitter,  the 
advantage  of  Vector  filter  over  single  electrode  selection  remains,  while 
the  advantage  of  cross-correlation  over  peak-picking  depends  on  the 
correspondence  between  the  template  and  the  target  component.  The  most 
interesting  comparison  is  between  cross-correlation  and  Woody  filter  —  that 
is,  the  effect  of  iteration.  Woody  filter  with  two  iterations  yields 
latency  estimates  that  are  more  accurate  than  cross-correlation  when  the 
component  is  sharper  than  the  template,  while  no  advantage  for  the  Iteration 
procedure  is  evident  when  the  template  is  equal  to  or  sharper  than  the 
component. 

Simulation  5:  Effect  of  variation  in  frequency  filter  parameters. 

The  data  presented  In  Simulation  1  Indicates  that  the  use  of  a  low-pass 
frequency  filter  leads  to  an  Improvement  In  the  estimation  of  P300  latency. 
The  lower  the  cut-off  point,  the  greater  was  the  Improvement.  Of  course,  at 
some  point,  extremely  low  cut-offs  will  completely  attenuate  the  P300  and 
therefore  lead  to  poor  detection.  To  determine  which  cut-off  level  leads  to 
the  greatest  accuracy  In  the  estimation  of  P300  latency,  we  ran  another 
simulation  with  a  wider  range  of  cut-off  settings. 

Method 

Five  cut-offs  points  (-3  dB)  were  used:  6.29  Hz,  3.38  Hz,  2.32  Hz,  1.76 
Hz,  and  1.42  Hz.  Another  condition  was  Included  in  which  no  off-line  filter 
was  used.  Each  of  these  frequency  filter  conditions  was  applied  to  150 
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trials  generated  by  adding  different  samples  of  EEG  noise  to  target  and 
overlapping  components.  The  parameters  of  these  components  were  the  same  as 
in  Simulation  4  and  are  shown  in  Table  4. 

Results 

MSE  data  for  the  different  frequency  filters,  for  Vector  filter  and 
single  electrode  selection,  and  for  peak-picking  and  cross-correlation,  are 
reported  in  Table  4. 


Insert  Table  4  about  here 


Discussion 

The  results  confirm  that  frequency  filters  Improve  the  accuracy  of  the 
P300  latency  estimates.  This  is  particularly  true  for  latencies  estimated 
with  peak-picking,  while  the  gain  for  estimates  obtained  with 
cross-correlation  was  quite  small.  The  most  accurate  estimates  were 
obtained  at  a  cut-off  point  of  1.76  Hz.  Lower  and  higher  cut-offs  were 
associated  with  inferior  estimates,  although  those  obtained  for  adjacent 
cut-offs  were  almost  as  good  as  those  for  1.76  Hz.  This  suggests  that  the 
setting  of  the  low-pass  filter  does  not  need  to  be  very  precise. 

Simulation  6:  Effect  of  variation  of  Vector  filter  weights. 

The  weights  for  Vector  filter  used  in  the  basic  design  were  chosen  to 
discriminate  between  the  target  component  (P300)  and  various  sources  of 
noise.  The  scalp  distribution  we  filtered  was  not  that  of  the  target 
component  (In  our  case,  P300).  Recall  that  the  weights  of  this  filter  were 
chosen  on  the  basis  of  their  ability  to  discriminate  between  sets  of 
waveforms  associated  with  rare  and  frequent  stimuli.  However,  other  weights 


Simulation  studies  of  latency  measures 


30 


could  have  been  chosen,  and  some  of  them  might  have  produced  superior 
results  than  the  set  selected.  To  evaluate  the  consequences  of  the  choice 
of  Vector  filter  weights,  we  varied  them  systematically  and  studied  their 
impact  on  the  accuracy  of  latency  estimation. 

Method 

For  this  study,  we  used  a  P300  amplitude  of  250  units  (signal-to-nolse 
ratio  =  2.5).  Background  EEG  noise  was  simulated  with  standardized 
waveforms,  and  no  frequency  filtering  was  used.  The  overlapping  components 
included  N200  (amplitude  =  50  units;  latency  =  300  ms)  and  Slow  Wave 
(amplitude  =  100  units;  latency  =  800  ms).  Following  the  nomenclature  used 
previously,  this  condition  may  be  described  as  "moderate  Slow  Wave  overlap." 
P300  latency  was  estimated  using  cross-correlation  (the  length  of  the 
template  was  equal  to  that  of  P300).  This  procedure  was  selected  because  It 
yielded  the  best  results  in  the  basic  simulation. 

The  47  sets  of  weights  of  Vector  filter  used  in  this  simulation  are 
presented  in  Table  5. 


Insert  Table  5  about  here 


Note  that  45  of  these  sets  can  be  organized  In  9  groups  of  five,  such  that 
each  group  represents  an  "individual"  profile,  whose  mean  is  equal  to  0.00, 
0.15,  0.29,  0.41,  and  0.50.  This  variation  In  mean  was  chosen  to  encompass 
possible  variations  in  the  sensitivity  of  the  filter  to  variation  in 
polarity.  The  nine  profiles  (labelled  with  the  letters  from  A  to  I)  were 
selected  so  as  to  cover  a  variety  of  filters,  all  satisfying  the  criterion 
of  having  the  parietal  electrode  weighted  equally  or  more  positively  than 
the  other  electrodes.  Note  that  the  filter  having  the  profile  labelled  C 
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and  the  mean  of  the  weights  equal  to  0.15  is  that  used  for  all  the  previous 
simulations.  The  last  two  sets  of  weights  characterize,  respectively,  a 
filter  that  is  equivalent  to  the  selection  of  the  Pz  electrode,  and  a  filter 
equivalent  to  computing  the  average  across  electrodes.  The  selection  of  the 
Pz  electrode  can  also  be  considered  equivalent  to  a  filter  having  a  profile 
E,  and  a  mean  of  the  weights  equal  to  0.33.  The  average  of  the  electrodes 
correspond  to  a  filter  with  all  the  weights  equal  to  0.58.  Note  that,  for 
each  set,  the  sum  of  squares  of  the  weights  is  equal  to  1.00.  Thus,  as  the 
profiles  change  and  as  their  mean  value  changes,  the  units  of  the  output  of 
the  Vector  filters  do  not  change.  Note,  finally,  that  the  "input"  scalp 
distribution  of  P300  (In  this  study)  was  equivalent  to  a  set  of  weights  with 
profile  G,  and  mean  equal  to  0.54.  This  set  of  weights  is  also  shown  in 
Table  5.  Some  of  the  profiles  are  shown  graphically  in  Figure  8. 


Insert  Figure  8  About  Here 


Results 

The  effect  of  varying  the  weights  of  Vector  filter  was  evaluated  by 
comparing  the  accuracy  obtained  with  each  of  the  47  sets  of  weights  with  the 
accuracy  obtained  with  single  electrode  selection  (Pz).  Thus,  the  MSE 
associated  with  the  single  electrode  selection  (Pz)  was  divided  by  that 
observed  for  each  set  of  weights  to  yield  a  relative  performance  measure  for 
each  filter.  Note  that  a  value  above  1  Indicates  that  the  MSE  obtained  with 
a  given  set  of  weights  is  smaller  than  that  obtained  with  single  electrode 
selection  (and,  therefore,  that  the  latency  estimate  is  more  accurate), 
while  a  value  below  1  Indicates  a  higher  MSE.  In  general,  the  higher  the 
relative  performance  measure,  the  better  the  filter.  These  relative 
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performance  values  are  shown  In  Table  6. 


Insert  Table  6  about  here 


Discussion 

Table  6  shows  that  the  optimal  set  of  Vector  filter  weights  corresponds 
to  the  profile  C  with  a  mean  value  of  0.15.  This  is  in  fact  the  set  of 
weights  for  Vector  filter  we  used  in  the  basic  design.  With  this  setting 
the  gain  in  the  accuracy  of  latency  estimates  relative  to  single  channel 
selection  is  more  than  20%.  Thus,  the  parameters  we  chose  were  optimal. 

Note  that  this  set  of  weights  was  originally  selected  on  the  basis  of 
observed  data.  The  fact  that  these  weights  yield  the  best  performance  In 
our  simulation  supports  the  procedure  (discriminant  analysis  between  a  set 
of  rare  and  frequent  trials)  we  used  to  derive  this  set  of  weights. 

Although  the  set  of  weights  used  In  the  basic  design  yields  the  most 
accurate  latency  estimates,  other  sets  of  weights  also  produce  improvement 
in  comparison  to  single  electrode  (Pz)  selection.  In  general,  these  sets 
share  the  following  characteristics:  (a)  the  weights  are  ordered  with 
maximum  values  at  Pz,  followed  by  Fz,  and  by  Cz;  (b)  the  mean  of  the  weights 
is  positive;  and  (c)  the  weight  for  Cz  is  negative. 

Note  that  several  sets  of  weights  yield  accuracy  that  is  worse  than 
single  channel  selection.  This  is  not  surprising.  In  fact,  these  sets  of 
weights  correspond  to  scalp  distributions  which  do  not  enhance  the 
discrimination  between  signal  and  noise.  Rather,  they  enhance  the  noise  or 
reduce  the  signal. 


The  results  of  this  simulation  indicate  that  filtering  for  a  scalp 
distribution  that  is  different  from  that  of  the  target  component  can  produce 
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even  better  results  than  filtering  for  the  target  component  itself 
(equivalent  to  profile  G  with  mean  of  0.54).  This  is  likely  to  occur  when 
the  scalp  distribution  of  the  target  component  and  that  of  the  noise 
overlap.  It  is  important  to  note  that  although  the  average  background  noise 
value  is  equal  to  0,  it  can  be  characterized  on  single  trials  by  large 
positive  and  negative  peaks.  To  the  extent  that  the  scalp  distribution  of 
the  positive  peaks  is  similar  to  that  of  the  target  component,  these  peaks 
may  be  misidentif ied  as  the  target  component.  For  instance,  a  positive 
noise  peak  may  be  characterized  by  large  voltage  values  at  Pz  and  Cz,  and 
thus  resemble  the  scalp  distribution  of  P300.  In  our  case,  high  positive 
values  at  Pz  tend  to  co-occur  with  high  positive  values  at  Cz  (the 
correlation  between  noise  at  these  two  electrodes  is  above  0.80).  For  this 
reason,  a  larger  reduction  of  noise  and  consequently  an  improvement  of  the 
signal-to-noise  ratio,  is  obtained  by  filtering  for  a  scalp  distribution 
that  maximizes  the  discrimination  between  the  P300  and  the  positive  noise 
peaks.  This  is  achieved  when  different  polarity  values  are  assigned  to  the 
weights  for  Pz  and  Cz.  In  general,  it  is  advisable  to  use  a  set  of  weights 
which  dissociates  the  activity  of  different  electrodes  with  strong  noise 
coherence.  A  detailed  discussion  of  the  selection  of  weights  Is  given  In 
Gratton  et  al.  (submitted). 


General  Discussion 

The  results  presented  in  this  paper  indicate-  that  the  accuracy  of 
latency  estimation  is  affected  by  several  variables,  including  the 
signal-to-noise  ratio,  characteristics  of  signal  and  noise,  the  use  of 
preparatory  (filtering)  procedures,  and  the  choice  of  the  signal  detection 
algorithm. 
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The  signal-to-noise  ratio  appears  to  be  the  most  important  factor.  In 
general,  the  error  of  estimation  decreases  exponentially  with  Increases  in 
the  signal-to-noise  ratio.  Thus,  any  methodology  which  enhances  the  signal- 
to-noise  ratio  is  very  valuable.  However,  the  effect  of  the  signal-to-noise 
ratio  does  not  appear  to  interact  with  other  effects.  Procedures  which 
yield  the  most  accurate  estimates  at  high  levels  of  signal-to-noise  ratio 
tend  to  produce  the  most  accurate  estimates  at  low  levels.  Thus,  while 
knowledge  of  the  signal-to-noise  ratio  may  be  critical  for  estimating  the 
power  of  the  procedure,  it  is  Irrelevant  for  the  choice  of  the  algorithm  for 
latency  estimation. 

The  use  of  spatial  information  for  enhancing  the  detection  of  ERP 
components  can  be  very  useful,  particularly  when  information  about  the  scalp 
distribution  of  the  target  component  is  available.  In  general.  Vector 
filter  produced  more  accurate  latency  estimates  than  single  electrode 
selection  at  Pz,  or  than  the  average  across  electrodes.  This  effect  was 
most  evident  when  overlapping  components  were  present  and  when  these 
components  had  a  scalp  distribution  which  was  different  from  the  P300 
(N200).  However,  the  choice  of  the  weights  of  Vector  filter  is  also 
important.  These  weights  should  be  such  that  the  discrimination  between 
signal  and  noise  Is  enhanced,  rather  than  merely  mirroring  the  spatial 
distribution  of  the  signal.  In  our  case,  we  used  a  discriminant  function 
analysis  between  two  sets  of  observed  waveforms  from  rare  and  frequent 
trials  to  select  the  set  of  weights  for  P300.  TMs  procedure  yields  weights 
which  dissociate  the  parietal  and  the  central  electrode,  while  still 
emphasizing  the  positive  trend  across  electrodes.  These  weights  fared  quite 
well  when  compared  to  the  selection  of  the  Pz  electrode,  or  to  other  sets  of 
weights  that  emphasize  the  positive,  parietal  distribution  of  P300. 
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Frequency  filtering  produces  improvement  in  the  accuracy  of  latency 
estimation.  The  advantage  is  particularly  evident  when  peak-picking  is  used 
as  the  signal  detection  procedure.  The  accuracy  obtained  with 
cross-correlation  does  not  seem  to  be  much  improved  by  the  use  of  low-pass 
frequency  filters,  possibly  because  this  procedure,  in  contrast  with 
peak-picking,  is  already  based  on  several  time-points.  Our  Investigation 
showed  that  the  optimal  frequency  filter  for  P300  has  a  cut-off  at  1.76  Hz. 
Further  filtering  produces  impairment  of  signal  detection  because  the  signal 
itself  is  degraded.  Of  course,  the  signal  Is  also  degraded  with  a  cut-off 
of  1.76  Hz,  but  the  noise  is  presumably  degraded  to  a  greater  extent.  It 
should  be  noted  that  our  findings  relate  only  to  filters  based  on  moving 
averages,  and  that  we  did  not  compare  the  effect  of  filters  with  other  band¬ 
pass  characteristics,  or  the  effect  of  high-pass  filters. 

The  choice  of  the  signal  detection  algorithm  may  also  affect  the 
accuracy  of  latency  estimates.  Cross-correlation  produces  better  results 
than  peak-picking,  at  least  when  the  wavelength  of  the  template  is 
comparable  to,  or  shorter  than,  the  wavelength  (duration)  of  the  signal. 

The  difference  between  the  two  procedures  may  also  be  reduced  by  the  use  of 
appropriate  frequency  filters.  Two  or  three  Iterations  of  Woody  filter  do 
not  yield  significant  improvement  over  cross-correlation  alone  in  cases  In 
which  the  template  for  cross-correlation  has  a  wavelength  comparable  to,  or 
shorter  than,  that  of  the  signal.  However,  the  Woody  filter  iterations 
produced  a  marked  improvement  in  accuracy  in  those  cases  in  which  the 
wavelength  of  the  template  was  much  longer  (2  times  or  more)  than  that  of 
the  signal.  Thus,  cross-correlation  alone  appears  the  best  choice  when  the 
duration  of  the  target  component  is  known  (at  least  approximately).  When  no 
Information  is  available,  cross-correlation  with  iterative  Woody  filter 
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should  be  used.  The  use  of  peak-picking  should  be  restricted  to  the 
detection  of  sharp  (duration  equal  or  less  than  300  msec)  components,  and, 
even  in  these  cases,  its  use  may  be  justified  mainly  on  the  basis  of  Its 
simplicity  and  low  computational  load. 

As  a  general  commentary,  the  results  of  these  studies  emphasize  that 
the  characteristics  of  both  signal  and  noise  must  be  considered  for  the 
choice  of  procedures  for  the  estimation  of  the  latency  of  ERP  components. 

The  interaction  between  signal  and  noise  characteristics  was  particularly 
evident  for  the  choice  of  spatial  filtering  procedures.  However,  we  believe 
that  this  is  merely  an  instance  of  a  general  principle,  and  that  ERP  signal 
detection  algorithms  should  be  based  on  those  characteristics  of  the  signal 
which  allow  its  discrimination  from  the  noise  in  which  it  is  embedded. 
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Footnotes 

1.  Other  sources  of  noise,  such  as  those  related  to  eye-movements, 
electromyographic  activity,  and  electrocardiographic  activity,  will  not  be 
considered  here. 

2.  Gratton  et  al.  (submitted)  described  a  procedure  that  allows  the 
investigator  to  decompose  the  ERP  observed  at  several  electrode  locations 
into  the  contribution  of  components  charaterized  by  different  scalp 
distributions.  The  procedure  is  comprised  of  three  critical  elements,  (a) 
an  obtained  ERP  waveform  for  each  of  several  electrodes,  for  which  we  want 
to  know  the  time  course  of  the  constituent  components;  (b)  a  set  of 
characteristic  weights  (one  for  each  electrode)  for  each  component  --  these 
weights  describe  the  characteristic  scalp  distribution  of  the  component;  and 
(c)  an  output  waveform  (amplitude  x  time  function)  for  each  component  that 
describes  its  timecourse  in  the  obtained  waveforms.  This  output  waveform 
can  be  interpreted  as  a  result  of  the  application  of  a  specific  spatial 
filter  that  emphasizes  the  contribution  of  a  component  with  a  particular 
scalp  distribution.  The  amplitudes  of  each  component  at  each  time  point  are 
derived  by  using  a  least  squares  criterion.  In  most  of  the  simulation 
studies  presented  in  this  paper,  only  one  set  of  weights  is  used.  However, 
in  the  last  simulation,  the  use  of  other  sets  of  weights  will  be  examined. 

3.  The  "Woody  filter"  has  been  applied  to  ERPs  (e.g.,  Kutas  et  al., 
1977)  to  overcome  the  problem  of  latency  jitter.  The  procedure  was  used  to 
obtain  estimates  of  the  waveshape  of  the  target  component  that  were 
progressively  more  accurate  at  each  iteration. 

Wastell  (1977)  Investigated  the  utility  of  the  Iterating  procedure 
proposed  by  Woody  (1967).  He  reported  that,  if  an  appropriate  template  has 
been  selected,  iterations  do  not  improve  the  detection  of  the  signal. 
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Furthermore,  Pfefferbaum  (1983)  found  that  iterations  of  the  Woody  procedure 
may  extract  spurious,  artifactual  components  that  are  indistinguishable  from 
real  components  when  the  signal-to-noise  ratio  is  very  low.  He  concluded 
that  cross-correlaticnal  techniques  (and  Woody  filter  in  particular)  produce 
the  best  results  at  relatively  high  signal-to-noise  ratios. 

4.  In  this  experiment,  the  subject  was  presented  with  one  of  two  tones 
on  any  given  trial.  The  tone  probabilities  were  0.2  and  0.8.  ERPs  were 
recorded  at  Fz,  Cz,  and  Pz.  The  on-line  filtering  procedure  Included  a  low- 
pass  filter  with  a  half  amplitude  cut-off  point  at  35  Hz,  and  a  high-pass 
filter  with  a  time  constant  of  8  sec.  Vertical  EOG  was  recorded  from  above 
and  below  the  right  eye,  and  ocular  artifacts  were  corrected  with  a 
procedure  described  in  Gratton,  Coles,  and  Donchin  (1983).  Separate 
averages  were  obtained  for  frequent  and  rare  trials  and  for  each  electrode. 

5.  This  measure  was  used  to  assess  the  relative  power  of  the  different 
procedures  and  was  obtained  as  follows.  The  MSE  can  be  considered  an 
estimate  of  the  standard  deviation  of  the  population  of  single-trial  P300 
latency  estimates  for  each  condition.  (Note  that,  in  this  case,  the 
population  mean  is  known.)  However,  the  mean  of  the  single-trial  estimates 
of  the  sample  may  not  correspond  to  the  mean  of  the  population  (550  msec). 

If  the  normality  assumption  is  met,  we  can  compute  the  theoretical 
distribution  of  the  population  of  sample  means  from  which  the  mean  of  our 
sample  is  extracted.  Following  the  theorem  of  central  tendency,  this 
distribution  will  have  a  width  (measured  by  the  standard  error  of  estimate) 
that  is  proportional  to  the  MSE  (standard  deviation)  and  Inversely  related 
to  square  root  of  the  number  of  trials  used  to  compute  the  mean.  By 
increasing  the  number  of  trials  we  may  theoretically  reduce  the  standard 
error  of  the  estimate  to  any  desired  value.  Thus,  by  appropriately  setting 
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the  sample  size,  we  may  in  theory  obtain  a  standard  error  of  estimate  of  3 
msec,  given  a  specific  value  of  the  MSE.  In  fact,  we  can  compute  the  sample 
size  required  with  the  following  equation: 

2 

N  =  (MSE  /  3)  +  1 

where  N  is  the  number  of  trials  required  to  obtain  a  standard  error  of  3 
msec.  Note  that  this  value  is  only  an  approximation.  In  fact,  it  requires 
(a)  that  the  distribution  of  the  single-trial  estimates  is  normal,  and  (b) 
that  the  sample  mean  is  not  systematically  different  from  550  msec.  The 
first  assumption  is  violated,  since  only  values  inside  the  time  window  (300 
to  800  msec)  are  possible.  However,  the  distribution  of  the  single-trial 
estimates  is  approximately  normal  when  the  signal-to-noise  ratio  is  larger 
than  1.  Examples  of  distributions  of  single-trial  estimates  for  different 
signal-to-noise  ratios  will  be  shown  later.  The  second  assumption  may  also 
be  violated,  but  it  holds  in  most  cases.  Since  the  number  of  trials 
required  to  obtain  a  standard  error  of  estimate  of  3  msec  are  related  to  the 
MSE,  we  simply  added  a  scale  reporting  the  corresponding  values  for  this 
dependent  variable  in  most  of  the  figures  in  which  MSE  (or  log  MSE)  is 
presented. 
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Table  1.  Basic  Design 


(A)  Trials  per  condition  (100) 


(B)  Signal  detection  algorithms  (4) 

peak-picking 

cross-correlation 

Woody  filter  with  two  Iterations 

Woody  filter  with  three  Iterations 


(C)  Scalp  distribution  filtering  techniques  (2) 

single  electrode  selection  (Pz) 
Vector  filter 


(D)  Target  component  amplitude  levels  (11) 

signal-to-noise  ratio  varying  from  0  to  5  by  .5  Increments 


(E)  Amplitude  levels  of  overlapping  components  (5) 
no  overlap 

N200:  50,  100 

Slow  Wave:  100,  200 


(F)  Latency  levels  of  overlapping  components  (4) 

N200:  300,  400 

Slow  Wave:  800,1080 


(G)  Frequency  filter  conditions  (5) 
no  filter 

6.29  Hz  (7  points,  i  iteration) 
4.43  Hz  (7  points,  2  iterations) 

3.38  Hz  (13  points,  1  iteration) 

2.38  Hz  (13  points,  2  iterations) 


(H)  Dependent  Variables  (2) 

root  mean  square  error  (MSE) 

number  of  trials  required  to  obtain  3  msec  error 
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Table  2 

Parameters  of  ERP  Components 


Amplitude 

Latency 

Duration 

Scalp  Distribution 

(units) 

(msec) 

(msec) 

Weights 

Components 

Fz  Cz  Pz 

N100 

100 

100 

100 

-0.6 

-1.8 

-0.6 

P200 

150 

250 

150 

1.0 

1.0 

0.5 

N200 

50/100 

300/400 

250 

-1.2 

-0.8 

-0.4 

Slow  Wave 

100/200 

800/1080 

800 

-0.4 

0.0 

0.4 

P300 

0  to  500 

550 

500 

0.4 

0.8 

1.2 
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Table  3 

MSE  in  the  case  of  Latency  Jitter 


Measures 

300  msec 

P30G  duration 

500  msec 

700  msec 

Peak-picking 

Pz  selection 

40 

55 

75 

Vector  filter 

24 

49 

68 

Cross-correlation 

Pz  selection 

61 

47 

75 

Vector  filter 

26 

35 

67 

Woody  filter  (2  iterations) 

Pz  selection 

56 

41 

74 

Vector  filter 

20 

37 

66 

Woody  filter  (3  iterations) 

Pz  selection 

49 

42 

74 

Vector  filter 

20 

43 

67 

Component  parameters: 

P300  amplitude  =  250  units 

(signal-to-nolse  ratio  =  2.5) 

N100  amplitude  =  100  units 

N100  latency  =  100  msec 

P2Q0  amplitude  =  150  units 

P200  latency  =  250  msec 

N20Q  amplitude  =  50  units 

N200  latency  *  300  msec 

Slow  Wave  amplitude  =  100  units 

Slow  Wave  latency  *  800  msec 
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Table  4 

MSE  for  different  low-pass  cut-offs 


-3  dB 

Cut-offs 

Pz 

Peak-Picking 

Vector  filter 

Cross-Correlation 

Pz  Vector  filter 

No  filter 

50 

47 

31 

34 

6.29  Hz 

47 

44 

31 

30 

3.38  Hz 

40 

38 

42 

32 

2.32  Hz 

46 

39 

49 

37 

1.76  Hz 

38 

38 

32 

26 

1.42  Hz 

44 

42 

43 

36 

Component  parameters: 

P300  amplitude  = 

250  units 

(signal-to-noise  ratio  =  2.5) 

N100  amplitude  = 

100  units 

N100  latency 

=  100  msec 

P200  amplitude  x 

150  units 

P200  latency 

=  250  msec 

N200  amplitude  » 

50  units 

N2Q0  latency 

=  300  msec 

Slow  Wave  amplitude  =  100  units 


Slow  Wave  latency  =  800  msec 
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Table  5.  Sets  of  Vector  filter  weights  used  for  Simulation  6. 


Mean  of  the  Profiles 


wei 

ights 

A 

B 

C 

0 

E 

F 

G 

H 

I 

Mean 

=  0.00 

Fz 

0.41 

0.21 

0.00 

-0.21 

-0.41 

-0.58 

-0.71 

-0.79 

-0.82 

Cz 

-0.82 

-0.79 

-0.71 

-0.58 

-0.41 

-0.21 

0.00 

0.21 

0.41 

Pz 

0.41 

0.58 

0.71 

0.79 

0.82 

0.79 

0.71 

0.58 

0.41 

Mean 

=  0.15 

Fz 

0.54 

0.35 

0.15 

-0.05 

-0.24 

-0.41 

-0.53 

-0.61 

-0.64 

Cz 

-0.64 

-0.61 

-0.53 

-0.41 

-0.24 

-0.05 

0.15 

0.35 

0.54 

Pz 

0.54 

0.71 

0.83 

0.91 

0.94 

0.91 

0.83 

0.71 

0.54 

Mean 

=  0.29 

Fz 

0.64 

0.47 

0.29 

0.11 

-0.06 

-0.21 

-0.32 

-0.39 

-0.42 

Cz 

-0.42 

-0.39 

-0.32 

-0.21 

-0.06 

0.11 

0.29 

0.47 

0.64 

Pz 

0.64 

0.79 

0.90 

0.97 

1.00 

0.97 

0.90 

0.79 

0.64 

Mean 

=  0.41 

Fz 

0.70 

0.56 

0.41 

0.26 

0.12 

0.00 

-0.09 

-0.15 

-0.17 

Cz 

-0.17 

-0.15 

-0.09 

0.00 

0.12 

0.26 

0.41 

0.56 

0.70 

Pz 

0.70 

0.82 

0.91 

0.97 

0.99 

0.97 

0.91 

0.82 

0.70 

Mean 

=  0.50 

Fz 

0.70 

0.61 

0.50 

0.39 

0.30 

0.21 

0.15 

0.11 

0.09 

Cz 

0.09 

0.11 

0.15 

0.21 

0.30 

0.39 

0.50 

0.61 

0.70 

Pz 

0.70 

0.79 

0.85 

0.89 

0.91 

0.89 

0.85 

0.79 

0.70 

Selection  of  Pz  electrode*:  Fz  0.00 

Cz  0.00 
Pz  1.00 


Average  across  electrodes**:  Fz  0.58 

Cz  0.58 
Pz  0.58 

Scalp  distribution  of  input  P300***:  Fz  0.27 

Cz  0.54 
Pz  0.80 


*  This  is  equivalent  to  profile  E  with  a  mean  of  0.33. 


**  The  weights  within  each  set  were  chosen  with  the  constraint  that  the 
sum  of  their  squared  values  had  to  be  equal  to  1. 

***  This  is  equivalent  to  profile  G  with  a  mean  of  0.54. 
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Table  6.  Ratio  between  MSE  for  single  channel  selection  (Pz)  and  the 
MSE  for  each  of  the  Vector  filters. 


Mean  Profiles 


A 

8 

C 

0 

E 

F 

G 

H 

I 

0.00 

0.18 

0.34 

0.38 

0.61 

0.43 

0.51 

0.38 

0.45 

0.48 

0.15 

0.82 

1.13 

1.22 

1.13 

1.08 

0.90 

0.82 

0.79 

0.76 

0.29 

0.93 

1.03 

1.11 

1.07 

1.00 

1.00 

0.89 

0.91 

0.84 

0.41 

0.58 

1.04 

1.00 

1.04 

1.00 

0.92 

0.96 

0.93 

0.96 

0.50 

0.70 

0.93 

1.00 

1.05 

0.97 

1.00 

0.90 

0.93 

0.90 

Selection  of  Pz  electrode*:  1.00 

Average  across  electrodes:  0.67 


*  This  is  equivalent  to  profile  E  with  a  mean  value  of  0.33 
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Figure  Legends 

Figure  1.  Examples  of  simulated  single-trial  waveforms.  Waveforms 
without  EEG  noise  are  shown  in  the  upper  panel,  waveforms  with  EEG  noise  are 
displayed  in  the  lower  panel.  The  amplitude  is  expressed  in  arbitrary  units 
(simulated  P300  amplitude  =  250  units). 

Figure  2.  Log  MSE  and  number  of  trials  required  to  obtain  a  standard 
error  of  3  msec,  as  a  function  of  signal-to-nolse  ratio,  for  different 
frequency  filters,  signal  detection  algorithms,  and  spatial  filtering 
conditions. 

Figure  3.  Histograms  of  latency  estimates  for  four  different 
signal -to-noise  ratios  for  two  detection  algorithms  and  two  spatial 
filtering  techniques.  The  vertical  lines  indicate  the  latency  of  the 
simulated  P300. 

Figure  4.  Latency-adjusted  average  waveforms  over  100  trials.  P300 
peak  latency  was  computed  with  cross-correlation  (upper  panels).  Woody 
2-iterations  (middle  panels).  Woody  3-iteratlons  (lower  panels).  The  left 
column  shows  waveforms  obtained  with  a  signal-to-nolse  ratio  of  0  (no  P300 
was  present),  the  right  column  shows  waveforms  obtained  with  a 
signal-to-nolse  ratio  of  5.  The  solid  lines  Indicate  Pz  waveforms,  the 
dashed  lines  indicate  Vector  filtered  waveforms.  The  vertical  lines 
indicate  the  latency  of  the  simulated  P300.  The  amplitude  is  expressed  In 
arbitrary  units. 
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Figure  5.  Log  MSE  and  number  of  trials  required  to  obtain  a  standard 
error  of  3  msec,  as  a  function  of  signal-to-noise  ratio  for  four  different 
component  overlap  conditions.  The  results  for  two  detection  algorithms  and 
two  spatial  filtering  techniques  are  shown  for  each  condition. 

Figure  6.  Log  MSE  and  number  of  trials  required  to  obtain  a  standard 
error  of  3  msec,  as  a  function  of  the  amplitude  of  P300  for  the 
non-standardized  noise  condition.  The  results  for  two  detection  algorithms 
and  two  spatial  filtering  techniques  are  shown. 

Figure  7.  Effect  of  P300  duration  on  the  accuracy  of  latency 
estimation  with  peak-picking,  cross-correlation  and  Woody  filter  with  two 
and  three  iterations. 

Figure  8.  Schematic  representation  of  some  of  the  sets  of  the  Vector 
filter  weights  (profiles)  used  for  Simulation  6. 
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Article  20. 

Resource  Reciprocity:  An  Event-Related  Brain  Potentials  Analysis 

Erik  J.  Sirevaag,  Arthur  F.  Kramer,  Michael  G.H.  Coles,  Emanuel  Donchin 
University  of  Illinois,  Department  of  Psychology,  Champaign,  Illinois 

Abstract 

The  amplitude  of  the  P300  component  of  the  Event-Related  Potential  (ERP)  has 
proven  useful  in  identifying  the  resource  requirements  of  complex  perceptual -motor 
tasks.  In  dual-task  conditions,  increases  in  primary  task  difficulty  result  in 
decreases  in  the  amplitude  of  P300s  elicited  by  secondary  tasks.  Furthermore,  P300s 
elicited  by  discrete  primary  task  events  increase  in  amplitude  with  increases  in  the 
difficulty  of  the  primary  task.  The  reciprocity  in  P300  amplitudes  has  been  used  to 
infer  the  processing  tradeoffs  that  occur  during  dual-task  performance.  The  present 
study  was  designed  to  investigate  further  the  P300  amplitude  reciprocity  effect  under 
conditions  in  which  primary  and  secondary  task  ERPs  could  be  concurrently  recorded 
within  the  same  experimental  situation.  Forty  subjects  participated  in  the  study. 
Measures  of  P300  amplitude  and  performance  were  obtained  within  the  context  of  a 
pursuit  step  tracking  task  (the  primary  task)  performed  alone  and  with  a  concurrent 
auditory  discrimination  task  (the  secondary  task).  Primary  task  difficulty  was 
manipulated  by  varying  both  the  number  of  dimensions  to  be  tracked  (from  one  to  two), 
and  the  control  dynamics  of  the  system  (velocity  or  acceleration).  ERPs  were 
obtained  from  both  secondary  task  tones  and  primary  task  step  changes.  Average  root- 
mean-square  (RMS)  error  estimates  were  also  obtained  for  each  tracking  condition. 
Increased  primary  task  difficulty,  reflected  in  increased  RMS  error  scores,  was 
associated  with  decreased  secondary  task  P300  amplitudes  and  increased  primary  task 
P300  amplitudes.  The  increases  in  primary  task  P300  amplitudes  were  complementary  to 
the  decrements  obtained  for  the  secondary  task,  supporting  the  hypothesis  of 
reciprocity  between  primary  and  secondary  task  P300  amplitudes  across  several 
different  manipulations  of  primary  task  difficulty. 

KEYWORDS:  EVENT-RELATEO  8RAIN  POTENTIALS,  P300,  RESOURCES,  MENTAL  WORKLOAD, 

DUAL-TASK  PERFORMANCE 
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This  study  is  concerned  with  examining  the  assumptions  which  underlie 
the  use  of  the  P300  component  of  the  Event-Related  Brain  Potential  (ERP)  as 
a  measure  of  mental  workload.  The  ERP  represents  electroencephalographic 
(EEG)  activity  which  is  time-locked  to  an  event  (Donchin  et  al.  1978).  By 
averaging  over  records  which  follow  repetitions  of  the  event,  the 
contribution  to  the  average  ERP  of  background  activity  unrelated  to  the 
processing  of  the  event  (noise)  diminishes  while  the  contribution  of  the 
time-locked  activity  (signal)  is  enhanced.  The  P300  component  of  the  ERP  is 
represented  by  a  positive  voltage  deflection  maximal  over  the  parietal  scalp 
with  a  minimal  latency  of  300  msec  (Sutton  et  al.  1965). 

The  proposal  that  P300  amplitude  can  serve  a  role  in  the  measurement  of 
mental  workload  derived  from  observations  that  the  amplitude  of  the  P300  is 
severely  reduced  when  a  subject's  attention  is  directed  away  from  the  task 
in  which  the  eliciting  stimuli  are  embedded  (Johnson  and  Donchin  1978,  1982; 
Duncan-Johnson  and  Donchin  1977,  1978).  An  important  question,  however,  was 
whether  P300  amplitude  would  reflect  graded  changes  in  attention  that  are 
characteristics  of  different  levels  of  workload. 

A  systematic  relationship  between  P300  amplitude  and  mental  workload 
was  demonstrated  in  a  series  of  studies  that  employed  dual-task  techniques 
(Brown  1978;  Ogden  et  al.  1979).  In  these  studies,  subjects  performed  two 
tasks  concurrently  (see  Donchin  et  al.  1986  for  an  in  depth  review).  One 
task  was  designated  as  primary,  the  other  as  secondary.  Primary  tasks 
included  both  system  monitoring  and  manual  control.  The  ERP  eliciting 
secondary  tasks  involved  either  visual  or  auditory  discriminations  (Isreal 
et  al.  1980a;  Isreal  et  al.  1980b;  Natani  and  Gomer,  1981;  Kramer  et  al. 
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1983).  Results  of  these  studies  indicated  that  the  amplitudes  of  P300s 
elicited  by  secondary  task  stimuli  decreased  as  the  demands  placed  on 
subjects  by  the  primary  tasks  increased.  Furthermore,  this  relationship 
between  P300  amplitude  and  task  demands  occured  only  when  these  demands  were 
perceptual /cognitive  in  nature.  Motor  demands  did  not  systematically  affect 
the  amplitude  of  the  P300  component. 

The  results  of  these  studies  were  interpreted  within  the  framework  of 
Multiple  Resource  theory  (Kinsbourne  and  Hicks  1978;  Navon  and  Gopher  1979; 
Sanders  1979;  Wickens  1980;  Friedman  and  Poison  1981).  These  models  propose 
that  dual  task  performance  can  be  conceptualized  in  terms  of  a  number  of 
different  resources  that  are  limited  in  quantity.  Tasks  that  require  the 
same  types  of  resources  will  be  timeshared  more  poorly  than  tasks  that 
require  different  resources.  The  allocation  of  this  limited  processing 
commodity  to  the  performance  of  a  given  task  is  determined  by  the  motivation 
and  skill  of  the  operator  and  the  demands  of  the  task.  Workload,  therefore, 
is  viewed  as  a  hypothetical  construct  reflecting  the  interaction  between 
task  demands  and  operator  attributes  (Gopher  and  Donchin  1986). 

When  an  operator  is  in  a  demanding  multi-task  situation,  and  the  tasks 
require  the  same  resources,  tradeoffs  in  performance  will  be  observed.  If 
the  operator  is  instructed  to  optimize  performance  on  one  task,  fewer 
resources  will  be  available  for  the  other  tasks.  Furthermore,  increases  in 
primary  task  difficulty  entail  the  allocation  of  a  larger  share  of  the 
resources  to  the  primary  task.  Inevitably,  these  resources  are  no  longer 
available  to  the  secondary  tasks  and  performance  on  tne  secondary  tasks 
deteriorates  (Navon  and  Gopher  1979;  Norman  and  Bobrow  1975). 

While  the  results  of  the  P300  studies  are  consistent  with  the  Multiple 
Resource  model,  there  was  an  important  implication  that  had  not  been 
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adequately  examined.  We  refer  to  the  fate  of  the  P300  associated  with  the 
primary  task.  If  P300  amplitude  does,  in  fact,  reflect  the  resource 
tradeoffs  that  occur  during  dual-task  performance,  then  P3QQs  elicited  by 
primary  task  events  should  increase  in  amplitude  with  increases  in  the 
workload  of  the  primary  task.  An  even  stronger  prediction  is  that  in  dual¬ 
task  studies  in  which  ERPs  can  be  recorded  in  response  to  discrete  primary 
and  secondary  task  events,  there  should  be  a  reciprocal  relationship  between 
primary  and  secondary  task  P300  amplitudes. 

The  amplitude  reciprocity  hypothesis  was  explicitly  tested  by  Wickens 
et  al.  (1983)  who  required  subjects  to  track  a  discretely  changing  target 
with  a  cursor.  The  ERPs  elicited  by  the  step  changes  of  the  primary  task 
were  recorded  in  one  experimental  run;  while  those  elicited  by  the  tones 
counted  during  the  secondary  task  were  recorded  in  a  separate  session.  Tasx 
demands  were  varied  by  manipulating  the  number  of  time  integrations  between 
the  joystick  output  and  the  movements  of  the  cursor  on  the  screen  (so  that 
the  dynamics  of  the  system  changed  from  velocity  to  acceleration  control). 

The  data  indicated  that  P300s  associated  with  the  step  changes  increased  in 
amplitude  with  increased  primary  task  difficulty;  while  secondary  task  P300 
amplitudes  decreased  in  a  complementary  manner. 

While  these  data  did  confirm  the  basic  prediction  of  the  resource 
reciprocity  hypothesis,  we  considered  it  necessary  to  test  this  hypothesis 
with  ERPs  that  were  elicited  by  primary  and  secondary  task  events  recorded 
within  the  same  block  of  trials.  In  other  words,  the  case  for  amplitude 
reciprocity  would  be  stronger  if  a  reciprocal  relationship  between 
concurrently  recorded  primary  and  secondary  task  ERPs  is  found. 

Furthermore,  no  study  to  date  has  examined  the  relationship  between  task 
demands  and  P300  amplitude  at  the  level  of  the  single  subject.  We  report 
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here  a  study  that  tests  the  P300  amplitude  resource-reciprocity  hypothesis 
using  concurrently  recorded  primary  and  secondary  task  ERPs  both  across 
tasks  and  within  single  subjects. 

Previous  research  has  indicated  that  while  P300  amplitude  is  sensitive 
to  increases  in  the  system  order  of  a  tracking  task  (Kramer  et  al.  1983), 
manipulations  of  the  number  of  dimensions  in  which  a  subject  is  required  to 
track  produce  no  changes  in  secondary  task  P300  amplitude  (Wickens  et  al. 

1977).  Given  that  the  P300  is  sensitive  to  perceptual /cogni ti ve  demands  and 
is  relatively  unaffected  by  motor  demands  (Isreal  et  al.  1980a),  these 
results  imply  that  the  system  order  manipulation  places  a  heavy  load  on 
perceptual/cognitive  processing  while  the  dimensionality  manipulation  loaded 
primarily  on  response  processing.  Therefore,  an  orthogonal  manipulation  of 
dimensionality  and  system  order  should  provide  conditions  with  varying 
degrees  of  primary  and  secondary  task  resource  competition. 

In  the  present  study,  a  step  tracking  task  was  developed  in  which 
subjects  performed  four  conditions  (2  system  orders  x  2  dimensions)  within 
the  context  of  both  single  and  dual-task  instructions  (i.e.  the  presence  or 
absence  of  a  concurrent  auditory  discrimination  task).  This  combination  of 
independent  variables  allows  for  the  replication  of  past  findings  and  at  the 
same  time  introduces  a  previously  unexplored  condition  (two  dimensional 
tracking  in  a  second  order  system).  8ecause  the  present  study  contains  four 
levels  of  the  primary  task,  each  of  which  should  place  different  demands 
upon  the  operator,  it  provides  a  unique  opportunity  to  examine  the 
sensitivity  of  both  primary  and  secondary  task  P300  amplitudes  to  graded 
changes  in  workload-.  Thus,  the  present  study  examines  whether  the 
reciprocity  of  P300  amplitude  can  be  demonstrated  both  across  tasks  and 
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within  subjects  in  a  study  involving  orthogonal  combinations  of  dependent 
variables  and  concurrently  recorded  primary  and  secondary  task  ERPs. 

Method 

Subjects  Forty  dextral  males  between  the  ages  of  18  and  25  were  paid  for 
their  participation  in  this  study.  None  of  the  subjects  had  any  previous 
experience  with  the  step  tracking  task.  All  subjects  had  normal  hearing  and 
normal  or  corrected  to  normal  vision. 

Tasks  As  a  primary  task,  subjects  tracked  a  target  by  moving  a  cursor  on 
the  display  screen.  The  target  and  cursor  were  both  square  (.5  x  .5  cm)  and 
were  displayed  with  equal  intensity  on  a  Cathode  Ray  Tube  (CRT)  1.2  m  from 
the  subject.  Movements  of  the  target  square  were  under  computer  control. 

The  targets  moved  in  discrete  jumps  to  random  positions  on  the  CRT  with  an 
average  inter-mo;e  interval  of  3.8  sec.  The  jumps  could  occur  either  solely 
in  the  horizontal  or  in  both  the  horizontal  and  the  vertical  dimensions 
depending  on  the  experimental  condition.  The  sequence  of  jumps  was 
constrained  so  that  an  equal  number  of  jumps  to  the  left  and  right,  as  well 
as  up  and  down,  were  executed  in  a  given  block.  While  changes  in  the 
spatial  position  of  the  target  occurred  in  discrete  steps,  the  subject  was 
required  to  exercise  continuous  control  over  the  joystick  to  cancel  the 
error  between  the  tracking  elements.  In  each  dual-task  block  the  target 
changed  position  approximately  60  times. 

Subjects  controlled  the  position  of  the  cursor  on  the  screen  by 
manipulating  a  joystick  with  their  right  hand.  The  dynamics  of  the  system 
response  to  movements  of  the  joystick  were  determined  by  the  following 
equation: 
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X(T)=l(l-A)$U(T)  dtl+I(A)|fu(T)  dt | 

where  U=stick  position 
T=t ime 

A=contr ibution  of  the  second  order  system 
The  value  of  "A"  was  varied  to  create  velocity  and  acceleration  conditions. 
When  A=0,  the  system  was  a  pure  first-order  system  in  which  movements  of  the 
stick  increased  or  decreased  the  velocity  with  which  the  cursor  moved.  This 
will  be  referred  to  as  the  velocity  condition.  When  A=95,  the  system 
included  a  second-order  component.  That  is,  the  joystick  controlled  the 
acceleration  of  the  cursor.  In  this  “acceleration"  condition,  it  was 
considerably  more  difficult  to  achieve  control  over  the  cursor's  movements. 

For  the  secondary  task,  the  subjects  monitored  a  Bernoulli  sequence  of 
auditory  stimuli  presented  binaurally  through  TDH-39  headphones.  On  any 
particular  trial  a  high  (1400  Hz)  or  low  (1200  Hz)  pitch  tone  could  occur 
with  an  equal  probability.  The  subjects  task  was  to  count  covertly  the 
total  number  of  one  category  of  the  tones.  Within  a  given  block,  from  26  to 
35  target  tones  were  presented  and  the  interval  between  individual  tones 
averaged  3.8  sec.  The  duration  of  both  tones  was  60  msec  (including  a  10 
msec  rise/fall  time).  Since  we  were  interested  in  ERPs  to  both  step  and 
tone  stimuli,  the  presentation  schedule  for  the  tones  was  constrained  so 
that  the  recording  epochs  for  the  tones  (1280  msec)  and  the  step  changes 
(1280  msec)  did  not  overlap. 

Recording  system  Electroencephalograpnic  (EEG)  activity  was  recorded  from 
three  midline  sites  (Fz,  Cz,  and  Pz  according  to  the  International  10-20 
system:  Jasper  1958)  referenced  to  linked  mastoids..  Two  Beckman 
Biopotential  electrodes  were  attached  to  the  forehead  and  used  as  grounds. 
The  scalp  and  mastoid  electrodes  were  Burden  Ag-AgCl  electrodes  affixed  witn 
collodion.  The  vertical  electrooculogram  (EOG)  was  recorded  from  Beckman 
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electrodes  affixed  with  adhesive  collars  above  and  below  the  subject's  right 
eye.  All  electrode  impedances  were  below  10  kohms. 

The  EEG  and  EOG  were  amplified  by  Van  Gogh  model  50000  amplifiers  with 
a  10  sec  time  constant  and  a  low-pass  filter  of  35  Hz,  3  db/octave  rolloff. 

The  recording  epoch  for  both  the  EEG  and  EOG  was  1280  msec  beginning'  100 
msec  prior  to  either  the  primary  task  step  changes  or  the  secondary  task 
tones.  The  data  channels  were  digitized  every  10  msec  and  were  also 
filtered  off-line  (-3  db  at  6.29  Hz,  0  db  at  14.27  Hz)  prior  to  further 
analysis. 

Stimulus  generation  and  data  collection  Presentation  of  the  stimuli  and 
collection  of  the  data  were  under  the  control  of  a  PDP  11/40  computer  (see 
Donchin  and  Heffley  1975).  On  line  monitoring  of  both  average  and  single 
trial  EEG  and  EOG  was  accomplished  by  a  GT-44  display.  Contributions  of  the 
EOG  to  the  EEG  waveforms  were  evaluated  and  eliminated  off-line  by 
submitting  the  data  to  the  eye-movement  correction  algorithm  developed  by 
Gratton  et  al.  (1983). 

Procedure  Two  aspects  of  the  tracking  task  could  vary.  The  target  could 
move  in  one  or  two  dimensions.  The  control  system  could  be  either  a 
"velocity"  or  an  "acceleration"  system.  Thus,  four  different  formats  of  the 
tracking  task  were  obtained  by  crossing  dimensionality  with  control 
dynamics.  These  tasks  could  be  performed  either  alone  (single  task 
conditions),  or  concurrently  with  the  auditory  discrimination  task  (dual¬ 
task  conditions).  Each  of  the  forty  subjects  participated  in  all  eight 
experimental  conditions.  Each  condition  lasted  approximately  5  minutes  and 
was  followed  by  a  short  (2  min)  break. 

Following  electrode  placement,  subjects  were  told  that  they  were  about 
to  participate  in  a  study  to  assess  the  effects  of  task  difficulty  on  single 
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and  dual-task  performance.  Before  receiving  practice  in  the  step  tracking 
task,  all  subjects  performed  three  blocks  of  the  auditory  discrimination 
task  to  familiarize  them  with  the  stimuli  and  procedure.  Subjects  then 
performed  three  single  task  tracking  blocks  as  practice.  The  practice 
trials  consisted  of  one  block  of  velocity  tracking  in  two  dimensions,  and 
two  blocks  of  acceleration  tracking  (one  in  one  dimension  and  one  in  two 
dimensions).  After  completing  the  practice  blocks,  subjects  were  instructed 
to  assign  the  tracking  task  top  priority.  Thus,  tracking  was  defined  as  the 
primary  task.  The  subjects  were  told  that  while  they  should  try  to  count 
tones  accurately,  their  goal  was  to  perform  the  tracking  task  as  well  as 
possible. 

This  study  was  part  of  a  much  larger  research  effort  that  examined 
individual  differences  related  to  the  performance  of  perceptual /motor  tasks. 

For  this  reason,  it  was  necessary  to  insure  that  subjects  were  treated 
identically.  This  required  us  to  run  the  subjects  through  the  experimental 
conditions  in  a  fixed  order  rather  than  in  a  counterbalanced  fashion.  The 
order  of  the  experimental  blocks  is  presented  in  table  1. 

<Insert  table  1  about  here> 

Note  that  single  task  blocks  always  preceded  dual-task  blocks,  and  easier 
tracking  conditions  preceded  difficult  tracking  conditions.  While  this 
order  may  have  diminished  differences  between  single  and  dual-task 
performance  as  a  result  of  practice,  this  is  not  the  comparison  of  primary 
concern  in  this  study.  With  respect  to  the  comparison  that  is  of  interest 
(ie  between  dual-task  blocks  differing  in  dimensionality  and  system  order) 
any  learning  due  to  practice  effects  should  improve  performance  during  the 
more  difficult  conditions,  rather  than  enhance  performance  decrements  due  to 
increased  primary  task  difficulty.  Thus,  this  conservative  design  will  tend 
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to  diminish  differences  between  between  easy  and  difficult  tracking 
conditions  due  to  practice  effects  rather  than  amplify  them. 

Another  potential  confound  introduced  by  the  fixed  presentation  order 
is  a  possible  fatigue  effect.  Because  the  difficult  conditions  were 
presented  later  in  the  session,  any  performance  decrements  could  be  the 
result  of  increased  fatigue.  We  consider  this  explanation  unlikely  given 
that  the  experimental  session  rarely  lasted  for  more  than  one  hour  and 
subjects  were  given  the  opportunity  to  take  breaks  if  so  desired. 

Furthermore,  it  is  difficult  to  see  how  fatigue  could  artifactually  produce 
reciprocity.  Although  one  could  argue  that  secondary  task  P300  amplitude 
decrements  were  due  to  fatigue,  this  does  not  account  for  concomitant 
increases  in  the  amplitude  of  primary  task  P300s. 

Tracking  accuracy  data  were  collected  by  recording,  every  50  msec,  the 
distance  between  the  subject  controlled  cursor  and  the  target  square  and 
then  computing  the  root-mean-square  (RMS)  error  defined  by  these  values. 
Accuracy  in  the  auditory  discrimination  task  was  assessed  by  comparing  the 
subject's  count  of  target  tones  with  the  actual  number  of  tones  presented. 

Results  and  Discussion 

The  primary  and  secondary  task  performance  data  will  first  be  examined 
to  assess  the  extent  to  which  the  variations  in  dimensionality  and  system 
order  modulated  the  performance  of  the  primary  task.  The  RMS  error  measures 
will  be  used  to  define  the  difficulty  level  of  the  experimental  conditions. 
According  to  this  operational  definition,  the  claim  that  a  manipulation  has 
increased  the  difficulty  of  the  primary  task  will  be  made  if,  and  only  if, 
an  Increase  in  RMS  error  was  produced.  The  RMS  data  will  also  be  examined 
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to  determine  whether  subjects  did,  in  fact,  protect  the  level  of  performance 
on  the  primary  task  even  as  the  task  demands  increased.  This  is  an 
important  observation  for  secondary  task  performance  decrements  are 
difficult  to  intrepret  if  subjects  do  not  perform  the  primary  task 
comparably  during  both  single  and  dual-task  conditions. 

With  these  observations  established,  the  ERPs  elicited  during  the  dual¬ 
tasks  can  be  analyzed  to  assess  the  effects  of  increased  primary  task 
difficulty  on  the  amplitude  of  P300s  associated  with  the  secondary  task 
events.  Recall  that  with  increases  in  the  difficulty  of  the  primary 
tracking  task  we  predicted  that  (a)  increases  in  primary  task  P300s  would  be 
observed,  and  (b)  decreases  in  secondary  task  P300  amplitudes  would  be 
obtained.  Thus,  the  P300s  associated  with  the  primary  task  step  changes 
and  the  secondary  task  tones  will  be  evaluated  to  test  this  prediction. 

The  reciprocity  of  primary  and  secondary  task  P300  amplitudes  will  be 
analyzed  both  across  and  within  individual  subjects. 

Performance  data  The  average  root  mean  square  (RMS)  error  for  an 
experimental  condition  reflects  the  average  distance  between  the  cursor  and 
the  target  square.  Low  values  of  the  RMS  error  scores,  therefore,  reflect 
increased  tracking  accuracy.  To  facilitate  the  comparison  of  performance 
and  ERP  data,  the  RMS  data  were  range-corrected  according  to  the  following 
transformation: 

X(I)  -  X(MIN) 

X ( T)  =  100  *  - 

X(RNG) 

where,  X(T)=  transformed  score; 

X(I)=  score  obtained  in  a  given  condition; 

X(MIN)=  minimum  score  for  a  given  subject; 

X(RNG)=  range  of  scores  for  a  given  subject. 
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In  table  2  we  present  the  values  of  the  mean  range-corrected  RMS  error 
scores  for  all  single  and  dual-task  tracking  conditions. 

<Insert  table  2  about  here> 

It  is  evident  that  the  introduction  of  the  secondary  task  did  not  impair 
performance  of  the  primary  task.  The  RMS  error  scores  during  dual-tasks 
were  not  statistically  different  (£(  1, 39 ) *3 . 89 ,  p>.G5|  than  the  RMS  errors 
during  single  task  tracking.  The  same  pattern  of  results  was  also  obtained 
with  the  uncorrected  scores  indicating  that  our  results  were  not 
artifactual ly  produced  by  the  range-correction  procedure.  Indeed, 
differences  between  single  and  dual-task  RMS  error  scores  involved  less  than 
two  percent  of  the  total  variance  for  both  the  corrected  and  uncorrected 
measures. 

Because  the  RMS  error  data  were  compared  with  ERP  data  collected  only 
during  dual-task  conditions,  the  RMS  data  from  the  dual-task  conditions 
alone  were  submitted  to  range  correction  and  the  effects  of  the 
dimensionality  and  order  manipulations  were  examined  with  respect  to  this 
data  base.  Tracking  accuracy  declined,  that  is  error  rate  increased,  as 
dimensionality  increased  |F(l,39)=321.8i;  p< .01 ] ,  and  as  the  control  order 
was  increased  from  a  velocity  to  an  acceleration  system  [£(1 ,39)=2246.03; 

p< .0 1  ] . 

The  effect  of  system  order  was  consistently  larger  than  the  effect  of 
dimensionality.  This  is  illustrated  by  the  significant  interaction  between 
system  order  and  dimensionality  [F(l,39)=132.17;  p< . 01 ) .  Tukey  tests 
(Tukey  1977)  performed  on  pairwise  comparisons  indicate  that  order 
significantly  affected  performance  in  both  one  [F(l,39)=45.30;  p<.01]  and 
two  dimensions  (F(l,39)=61.40;  p< . 01 1 .  Similarly,  the  effect  of 
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dimensionality  was  significant  for  both  velocity  [F(l,39)=202.21;  p< . 0 1 1 , 
and  acceleration  [F{1,39)=4196.28;  p< . 01 ]  control  systems. 

Also  presented  in  table  2  are  the  mean  number  of  counting  errors  during 
the  auditory  discrimination  task.  While  analyses  of  these  data  indicated 
that  counting  performance  was  signif icantly  impaired  by  the  manipulation  of 
system  order  (£(1,39)=9.32;  p< . 004 | ,  the  magnitude  of  this  effect  was  quite 
small.  Because  the  average  number  of  errors  in  even  the  most  difficult 
condition  was  less  than  two,  we  assume  that  changes  in  the  ERPs  associated 
with  the  secondary  task  as  a  function  of  increased  primary  task  difficulty 
reflect  a  reallocation  of  resources  to  the  primary  task  rather  than  an 
unacceptable  level  of  secondary  task  performance. 

In  sumnary,  the  RMS  error  data  indicate  that  the  manipulations  of 
control  order  and  dimensionality  successfully  produced  a  range  in  tracking 
performance  suitable  for  the  analysis  of  P30Q  amplitude  reciprocity  under 
varying  levels  of  primary  and  secondary  task  competition  for  processing 
resources.  Furthermore,  the  RMS  data  confirm  that  subjects  protected  their 
performance  on  the  primary  task,  for  there  was  no  significant  increase  in 
RMS  scores  due  to  the  imposition  of  the  secondary  task.  Finally,  the  small 
number  of  counting  errors  in  all  dual-task  blocks  provides  support  for  the 
claim  that  changes  in  the  secondary  task  waveforms  cannot  be  explained 
simply  as  the  result  of  inadequate  performance  of  the  auditory 
discrimination  task. 

Secondary  task  ERP  data  The  secondary  task  ERP  data  will  now  be  examined  to 
determine  the  extent  to  which  variations  in  primary  task  workload  (as 
reflected  in  the  RMS  error  scores)  are  manifested  in  P300  amplitude 
variability  associated  with  the  different  secondary  task  conditions. 
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The  grand  average  target  ERPs  for  each  of  the  dual-task  conditions  are 
displayed  in  fig.  1.  As  predicted,  the  one-dimensional  velocity  condition 
was  associated  with  the  largest  secondary  task  P300,  and  the  smallest 
secondary  task  P300  was  elicited  during  the  most  difficult  two-dimensional 
acceleration  condition. 

<Insert  fig.  1  about  here> 

A  Principle  Components  Analysis  (PCA)  was  conducted  on  the  secondary  task 
ERP  data  to  quantify  the  effects  of  manipulations  of  system  order  and 
dimensionality  upon  the  P300  (see  Donchin  and  Heffley  1978;  Coles  et  al. 

1986,  for  a  discussion  of  this  procedure).  The  main  advantage  of  this 
technique  is  its  ability  to  separate  temporally  overlapping  components.  A 
data  matrix  consisting  of  960  trials  (40  Subjects  x  2  Dimensions  x  2  Control 
Orders  x  2  Stimulus  Categories  x  3  Electrodes]  was  submitted  to  the  PCA. 

The  grand  mean  waveform  and  the  component  loadings  derived  from  the  PCA  are 
displayed  in  fig.  2. 

<Insert  fig.  2  about  here> 

It  has  been  suggested  (Donchin  et  al.  1978;  Donchin  et  al .  1986)  that 
ERP  components  be  identified  according  to  three  criteria:  their  latency 
relative  to  a  stimulus  or  a  response;  their  amplitude  distribution  across 
different  electrode  sites;  and  their  sensitivity  to  task  manipulations. 

Utilizing  these  criteria,  component  5  can  be  identified  as  the  P300 
component  (it  has  the  correct  latency  and  scalp  distribution  and  displays  a 
significant  target  effect  ( F_(l ,39)  =8 .84;  p < . 00 5 1 . 

Having  established  component  5  as  representative  of  the  P300  we  will 
now  examine  the  effects  of  the  experimental  manipulations  of  dimensionality 
and  system  order  upon  the  amplitude  of  this  component.  The  requirement  to 
track  in  two  dimensions  signif icantly  reduced  the  amplitude  of  the  P300s 


.At 
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associated  with  the  concurrent  auditory  discrimination  task  [£(1 ,39)=30.84; 
p< .000 1 1  when  compared  with  one  dimensional  tracking  conditions.  Similar 
results  were  obtained  for  the  system  order  manipulation.  Thus,  smaller 
secondary  task  P30Qs  were  produced  during  acceleration  tracking  conditions 
than  when  the  tracking  task  required  velocity  control  |F(1,39)=21.49; 

p<.0001) . 

The  component  scores  for  this  factor  at  Cz  were  used  to  provide 
numerical  estimates  of  P300  amplitude  for  each  individual  subject  in  all  of 
the  dual-task  tracking  conditions.  These  amplitude  estimates  were  obtained 
so  that  variations  in  secondary  task  P300  amplitude  could  be  directly 
compared  with  primary  task  P300  amplitude  variability  both  within  and  across 
subjects.  Since  the  results  obtained  from  separate  PCAs  cannot  be  directly 
compared,  the  component  scores  were  submitted  to  the  range  correction 
algorithm  outlined  above  for  the  RMS  error  scores. 

The  mean  range-corrected  P300  component  scores  for  the  secondary  task 
are  presented  in  table  3.  Note  that  higher  component  scores  reflect 
increased  P300  amplitude. 

<Insert  table  3  about  here> 

The  analysis  of  variance  conducted  upon  the  range-corrected  scores  confirms 
that  the  manipulations  of  dimensionality  1F(1,39)=13.15;  p< . 0008 1  and  system 
order  |£(1,39)=18.27;  p< .000 1 ]  significantly  reduced  secondary  task  P300 
amplitude  with  no  sign'ricant  interaction  [£(1,39)=0.42;  p=0.52|.  Given  the 
RMS  error  results,  these  data  confirm  that  increased  primary  task  workload, 
in  every  case,  was  associated  with  a  reduction  in  the  amplitude  of  P300s 
generated  by  the  concurrently  performed  auditory  discrimination  task, 
confirming  the  previous  studies  (Isreal  et  al.  1980a;  Kramer  et  al .  1983; 

Natani  and  Corner,  1981;  Kramer  et  al .  1987). 
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Primary  Task  ERPs  The  ERPs  elicited  by  the  step  changes  in  the  various 
dual-task  tracking  conditions  are  displayed  in  fig.  3. 

<Insert  fig.  3  about  here> 

The  ERP  pattern  in  this  condition  is  quite  different  than  that  recorded  in 
response  to  the  secondary  task.  These  waveforms  are  dominated  by  a  large 
deflection  that  is  maximal  at  the  central  electrode  (C2).  It  is  evident 
that  the  scalp  distribution  and  the  early  latency  of  the  peak  suggest  that 
this  positive  deflection  is  not  a  P30Q.  We  conclude,  rather,  that  P300  in 
these  waveforms  overlaps  with  this  component  producing  differential  returns 
to  baseline  for  the  different  conditions.  It  is  also  evident  that  the 
amplitude  of  the  P300  component  in  all  of  the  conditions  was  quite  small. 

This  is  not  surprising  given  that  the  primary  task  step  changes  were  all 
equiprobable. 

Fig.  4  displays  the  effect  of  system  order  upon  the  primary  task 
waveforms  in  both  one  and  two  dimensions. 

<Insert  fig.  4  about  here> 

The  cross-hatched  areas  indicate  regions  of  increased  positivity  associated 
with  increased  system  order.  However,  the  differences  evident  in  the 
superaverages  are  small,  presumably  due  to  overlap  with  the  earlier  Cz 
maximal  component.  A  procedure  similar  to  the  one  outlined  for  the  analysis 
of  secondary  task  P300  amplitude  was  followed  for  the  analysis  of  the 
primary  task  ERP  data. 

A  PCA  was  performed  on  the  waveforms  associated  with  both  single  and 
dual -task  step  changes.  The  data  matrix  submitted  to  the  PCA  consisted  of 
960  trials  (40  Subjects  x  2  Task  levels  x  2  Dimensions  x  2  Control  Orders  x 
3  electrodes),  and  four  of  the  components  extracted  were  Varimax  rotated. 
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The  component  structure  extracted  by  this  PCA  is  displayed  in  fig.  5. 

<Insert  fig.  5  about  here> 

Component  1  is  active  in  the  appropriate  latency  range,  and  with  the  correct 
scalp  distribution  (F(2,78)=219.07;  p<. 00011  to  enable  its  identification  as 
the  component  corresponding  to  P300. 

Overall,  primary  task  P300  amplitude  increased  both  as  a  function  of 
increasing  the  number  of  dimensions  [F(l,39)=6.20;  p<.05|  as  well  increasing 
the  control  order  [F(l,39)=33.32;  p< . 00 1 1  of  the  tracking  task  with  no 
significant  interaction.  Furthermore,  both  the  dimension  and  order  effects 
interacted  with  electrode  site  such  that  modulation  of  the  component  was 
greater  at  Cz  than  Fz  or  Pz  (F(2,78)=  7.13;  p< .01;  and  £(2,78)=28. 13;  p<-01, 
respectively) . 

Numerical  estimates  of  primary  task  P300  amplitude  were  obtained  by 
extracting,  for  every  subject,  the  component  scores  at  Cz  output  by  the  PCm 
outlined  above  for  the  various  dual-task  tracking  conditions.  The  mean 
component  scores  are  presented  in  table  3.  Once  again,  to  facilitate 
amplitude  comparisons  between  the  primary  and  secondary  task  P300s,  the 
measures  were  corrected  for  range.  An  examination  of  table  3  reveals  that 
as  the  difficulty  of  the  primary  task  increased  (as  reflected  in  increased 
RMS  error  scores  and  decreased  secondary  task  P300  amplitudes),  the 
amplitude  of  the  P300s  associated  with  primary  task  events  also  increased. 

An  analysis  of  variance  conducted  on  the  range-corrected  estimates  of 
primary  task  P300  amplitude  confirms  that  larger  P300s  were  associated  with 
two  dimensional  tracking  conditions  [F(l/39)=5.45;  p<.02|,  as  well  as  with 
conditions  requiring  acceleration  control  [F(l/39)=28.57;  p<.0001).  The 
dimension  by  order  interaction  was  not  significant  [ £_(  1/ 39 )  =  1 . 62 ;  p=0. 20 ) . 
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Thus,  decreases  in  the  amplitude  of  secondary  task  P300s  were  accompanied  in 
every  case  by  increased  primary  task  P300  amplitudes. 

Combined  Analysis  The  preceding  analysis  can  be  criticized  on  the  grounds 
that  the  P300  amplitude  estimates  for  the  primary  and  secondary  tasks  were 
assessed  using  different  PCAs.  Therefore,  an  additional  analysis  was 
conducted  in  which  a  single  PCA  was  performed  on  the  waveforms  from  the 
concurrently  performed  primary  visual  and  secondary  auditory  tasks.  Thus,  a 
data  matrix  consisting  of  960  trials  (40  Subjects  x  2  Task  levels  x  2 
Dimensions  x  2  Control  orders  x  3  Electrodes),  was  submitted  to  a  PCA  in 
which  5  factors  were  extracted  and  Varimax  rotated.  The  component  structure 
associated  with  this  PCA  is  presented  in  fig.  6. 

<Insert  fig.  6  about  here> 

Interpretation  of  the  Anova  conducted  on  the  output  of  this  PCA  was 
complicated  by  the  fact  that  the  primary  and  secondary  tasks  required 
subjects  to  process  information  presented  in  different  modalities.  As  a 
result,  the  waveforms  associated  with  the  visual  primary  task  differed 
considerably  from  the  waveforms  of  the  auditory  secondary  task.  For 
example,  the  P300  component  of  the  primary  task  was  evident  at  a  much  longer 
latency  than  the  P30Q  component  of  the  secondary  task.  Additionally,  the 
structure  of  the  components  surrounding  the  P300  was  different  for  the  two 
tasks. 

Because  of  the  differences  in  latency  and  component  structure,  the 
P300s  for  the  primary  and  secondary  task  emerged  as  two  separate  components 
in  this  analysis.  Since  component  2  has  a  parietal  maximum  scalp 
distribution  and  is  active  in  the  appropriate  latency  range  we  identify  it 
as  the  P300  component  for  the  primary  task.  Component  5  meets  these 
criteria  for  the  secondary  task,  and  is  therefore  identified  as  the  the  P300 
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for  the  auditory  task.  The  procedure  described  above  was  applied  to  obtain 
numerical  estimates  of  P300  amplitude.  Thus,  the  relevant  component  scores 
at  Cz  for  each  condition  were  extracted  and  range  corrected  for  every 
subject.  The  mean  estimates  of  P300  amplitude  averaged  across  subjects  are 
presented  In  table  4. 

^Insert  table  4  about  here> 

It  is  apparent  that  the  general  trends  evident  in  table  3  are  also  present 
in  table  4,  for  the  primary  and  secondary  task  P300s  are  ordered  in 
precisely  the  same  way. 

The  estimates  of  P300  amplitude  were  then  submitted  to  a  repeated 
measures  analysis  of  variance.  For  the  primary  task,  P300  amplitude  was 
greater  during  conditions  requiring  acceleration  control  than  velocity 
control  [F(l,39)=35.58;  p< .0001 1 .  This  effect  did  not  interact  with  the 
dimensionality  manipulation  [F(l,39)  =  1.56;  p> .  20  ] .  The  overall  effect  of 
increasing  the  number  of  dimensions  to  be  tracked  did  not  reach  statistical 
significance  [£(1,39)=3.88;  p=.056|.  Analysis  of  the  simple  main  effects 
indicated  that,  although  P300  amplitude  did  not  increase  as  a  function  of 
increased  dimensionality  for  velocity  systems  [F(l,39)=0.38;  p> . 50 ) ,  it  did 
increase  as  dimensionality  increased  for  acceleration  systems  [F(l,39)=5.52; 
p< . 025 1 .  These  results  closely  parallel  those  obtained  when  the  waveforms 
associated  with  the  step  changes  were  analyzed  separately. 

Secondary  task  P3Q0s  revealed  a  reciprocal  pattern  of  results  with 
respect  to  the  primary  task  changes.  Secondary  task  P300s  were  larger 
during  velocity  tracking  conditions  1F(1,39)=13.48;  p< .001 1 ;  as  well  as 
those  involving  one  dimensional  tracking  |F(1,39)=26.39;  p< . 0001 1 .  The 
dimension  by  order  interaction  was  not  significant  [£jl,39)=0. 33;  p=.57|. 

These  results  are  identical  to  those  obtained  when  secondary  task  P3Q0 
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amplitudes  were  analyzed  in  isolation  from  the  ERPs  associated  with  the  step 
changes. 

Reciprocity  An  important  goal  of  this  study  was  to  assess  the  degree  to 
which  P300  amplitudes  associated  with  the  two  tasks  would  be  reciprocal.  In 
fig.  7  the  amplitude  of  the  P300  components  in  the  primary  and  secondary 
tasks  extracted  from  table  3  are  plotted  as  a  function  of  the  RMS  error 
scores  for  each  of  the  dual-task  conditions. 

<Insert  fig.  7  about  here> 

Inspection  of  the  amplitude  estimates  in  table  4  reveals  that  the 
conclusions  would  remain  the  same  had  the  results  of  the  combined  analysis 
been  used  instead.  The  line  at  the  top  of  fig.  7  represents  the  sum  of  the 
primary  and  secondary  task  P300  amplitudes.  Perfect  amplitude  reciprocity 
would  generate  a  function  with  a  slope  of  zero  and  an  intercept  value  of 
100.  As  can  readily  be  seen  by  examining  the  obtained  function,  the 
evidence  for  amplitude  reciprocity  is  quite  good.  Difficult  tracking 
conditions  produced  a  demand  for  perceptual  resources  resulting  in  increased 
primary  task  P300s  and  decreased  secondary  task  P300s.  Furthermore,  the 
greater  the  increase  in  primary  task  P300  the  greater  the  decrease  in 
secondary  task  P300.  This  experiment,  therefore,  provides  the  first 
evidence  for  amplitude  reciprocity  obtained  from  concurrently  recorded 
primary  and  secondary  tasks  of  different  modalities. 

To  determine  the  extent  to  which  this  pattern  of  reciprocity  held  true 
within  subjects,  separate  reciprocity  functions  were  obtained  for  each 
subject  and  the  regression  lines  for  these  functions  were  computed.  If  the 
single  subjects  also  demonstrated  significant  reciprocity  the  mean  slope  of 
these  derived  functions  should  equal  zero  and  the  mean  intercept  should 
equal  100.  These  data  are  presented  in  table  5.  Although  there  was 
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significant  variability  within  the  subjects  (indicating  the  presence  of 
instances  of  both  under  and  over  reciprocity)  the  obtained  value  of  0.04  for 
the  mean  slope  did  not  differ  significantly  from  the  predicted  value  of  0 
(t=0.11,p>0. 10);  and  the  mean  intercept  value  of  92.99  did  not  differ 
significantly  from  the  predicted  value  of  100  (t=1.57,  p>0.05).  An 
additional  within  subjects  analysis  was  also  performed  to  test  the 
reciprocity  hypothesis.  When  secondary  task  P30Q  amplitude  was  plotted  as  a 
function  of  primary  task  P300  amplitude,  a  negative  slope  for  this  function 
was  obtained  for  29  out  of  the  40  subjects.  For  the  11  subjects  with  . 
positive  slopes,  the  value  of  the  slope  was  generally  close  to  0  indicating 
that  their  P300s  did  not  vary  systematically  as  a  function  of  the 
experimental  manipulations.  Thus,  evidence  in  support  of  the  reciprocity 
theory  was  obtained  both  across  and  within  subjects. 

<Insert  table  5  about  here> 

Conclusions 

This  experiment  confirms  the  existence  of  a  reciprocal  relationship 
between  the  amplitudes  of  the  P300s  associated  with  two  concurrently 
performed  tasks.  The  prediction  of  reciprocity  derives  from  a  large  body  of 
evidence  which  has  indicated  that  variations  in  P300  amplitude  are  sensitive 
to  the  manner  in  which  subjects  allocate  processing  resources  between  two 
tasks  under  dual-task  conditions.  In  other  words,  P300  amplitude  has 
emerged  as  a  psychophysiological  metric  of  the  resource  tradeoffs  that  are 
presumed  to  underlie  the  concept  of  mental  workload  (Kahneman  1973;  Navon 
and  Gopher  1979;  Wickens  1980,  1984;  Gopher  and  Donchin  1986). 
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The  RMS  error  data  confirm  that  the  orthogonal  manipulation  of  system 
order  and  dimensionality  employed  in  this  study  successfully  produced  a  wide 
variability  in  performance  within  which  to  assess  the  reciprocity  of  primary 
and  secondary  task  P300  amplitudes.  Because  the  difficulty  of  the  secondary 
task  was  held  constant  during  all  the  step-tracking  conditions,  the  model  of 
resource  reciprocity  upon  which  this  experiment  is  based  predicts  that  as 
the  tracking  task  is  made  more  difficult,  primary  task  P300  amplitudes 
should  become  larger,  due  to  the  allocation  of  additional  processing 
resources;  and  secondary  task  P3Q0  amplitudes  should  decline  as  a  result  of 
the  drain  upon  this  limited  commodity. 

The  data  collected  during  this  experiment  confirm  this  assertion.  As 
the  complexity  of  the  primary  task  increased,  the  RMS  error  measures  also 
increased.  Furthermore,  the  amplitude  of  the  P3Q0s  associated  with  primary 
task  step  changes  increased,  while  the  amplitude  of  the  secondary  task  P300s 
elicited  by  the  auditory  stimuli  decreased  in  the  predicted  fashion.  This 
result  was  obtained  when  amplitude  measures  were  derived  from  both 
individual  and  combined  PCAs.  In  all  conditions,  the  increase  in  primary 
task  P300  amplitude  was  proportional  to  the  decrease  in  secondary  task  P300 
amplitude.  An  examination  of  fig.  7  confirms  that  the  summation  of  primary 
and  secondary  task  P300  amplitudes  yields  an  approximately  constant  value. 

The  validation  of  P300  amplitude  as  a  metric  of  a  particular  aspect  of 
the  workload  demands  of  a  task  has  a  number  of  theoretical  and  applied 
implications.  As  mentioned  earlier,  the  auditory  discrimination  task  is  an 
attractive  secondary  task  for  a  number  of  reasons.  ,The  most  important  of 
these  reasons  is  that  such  a  task  can  be  applied  in  a  relatively  un¬ 
obtrusive  fashion  in  many  different  situations  because  there  is  no  need  for 
an  overt  response.  Thus,  because  subjects  can  count  the  stimuli  rather  than 
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respond  overtly  to  them,  competition  for  response  related  processing 
resources  is  reduced  and  there  are  no  structural  effects  to  impede  the 
performance  of  the  primary  task.  The  RMS  error  data  from  this  experiment 
confirm  that,  indeed,. a  secondary  auditory  discrimination  task  can  be 
imposed  in  a  dual-task  setting  with  minimal  cost  to  the  performance  of  the 
primary  task. 

Another  advantage  of  the  discrimination  task  is  that  stimuli  of 
different  modalities  can  be  used  to  elicit  P300s.  The  modality  of  the 
secondary  task  can,  therefore,  be  chosen  to  eliminate  competition  for 
modality  specific  processing  resources.  In  this  experiment,  an  auditory 
secondary  task  was  chosen  because  the  step-tracking  task  required  visual 
stimulus  processing.  Had  the  primary  task  relied  more  upon  auditory 
processing,  a  visual  secondary  task  could  have  been  employed. 

For  these  reasons,  P300  amplitude  measured  under  dual -task  conditions 
can  be  used  in  the  analysis  of  demands  placed  upon  operators  in  complex  man- 
machine  systems.  The  P300  is.  a  relatively  unobtrusive  measure  sensitive  to 
graded  changes  in  task  difficulty.  Furthermore,  the  P300  is  diagnostic  of 
perceptual /cognitive  as  opposed  to  response-related  processing  (Isreal  et 
al .  1980a;  Isreal  et  al.  1980b).  Finally,  it  is  conceivable  that  with 
further  refinements,  such  as  the  application  of  step-wise  discriminant 
analysis  techniques  (Donchin  and  Herning  1975),  the  bandwidth  and 
reliability  of  the  P300  may  be  of  sufficient  quality  to  permit  the  analysis 
of  workload  on  a  moment  to  moment  basis. 

In  addition  to  validating  the  prediction  of  P300  amplitude  reciprocity, 
this  experiment  produced  a  number  of  other  important  results.  In 
particular,  previous  findings  concerning  the  nature  of  the  manipulation  of 
system  order  obtained  in  this  laboratory  have  been  both  replicated  and 
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extended.  Thus,  the  conclusion  by  Wickens  et  al.  (1983)  that  the 
manipulation  of  system  order  during  a  one-dimensional  tracking  task  produces 
a  salient  drain  on  perceptual/cognitive  processing  resources  has  been 
confirmed  and  extended  to  the  two  dimensional  case.  Secondary  task  P300s 
declined  and  primary  task  P300s  increased  in  amplitude  as  a  function  of 
increased  system  order  in  both  one  and  two  dimension. 

However,  Wickens  et  al.  (1977)  reported  that  changing  the 
dimensionality  of  tracking  has  no  effect  on  P3Q0  amplitude  in  velocity 
systems.  This  finding  was  not  replicated  by  our  study.  Primary  and 
secondary  task  P300  amplitude  did  significantly  vary  as  a  function  of  this 
manipulation  when  subjects  were  tracking  with  a  velocity  control  system. 
However,  Wickens  et  al .  used  a  different  paradigm  where  the  primary  task 
consisted  of  a  compensatory  tracking  task,  while  the  present  study  utilized 
a  pursuit  step  tracking  task.  Thus,  the  nature  of  the  dimensionality 
manipulation  was  different  in  the  two  paradigms.  This  difference  may 
account  for  the  discrepant  results.  Secondly,  it  should  be  recalled  that 
the  magnitude  of  the  dimensionality  manipulation  in  velocity  systems 
produced  the  smallest  changes  in  RMS  error  and  P300  amplitude  of  any  of  the 
manipulations  employed.  Thus,  although  this  manipulation  produced  a 
significant  change  in  workload,  the  magnitude  of  this  change  was  quite 
small.  It  is  conceivable  that  the  increased  power  resulting  from  an 
experiment  involving  40  subjects  allowed  us  to  detect  this  small  effect 
where  the  earlier  study  by  Wickens  et  al .  failed.  Furthermore,  this  study 
provides  the  first  evidence  that  an  increase  in  d jmenional ity  entails  a 
significant  drain  on  perceptual/cognitive  resources  during  acceleration 
tracking  (a  previously  unexplored  condition). 
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In  conclusion,  using  concurrently  recorded  ERPs  associated  with  primary 
and  secondary  tasks,  this  experiment  demonstrates  that  there  is  a  reciprocal 
relationship  between  the  amplitudes  of  the  P3Q0s  associated  with  the  two 
tasks.  Furthermore,  this  relationship  was  investigated  and  confirmed  under 
a  variety  of  levels  of  primary  and  secondary  task  competition  for  processing 
resources.  The  results  can  be  interpreted  within  a  model  of  dual-task 
performance  in  which  the  allocation  of  processing  resources  to  the  two  tasks 
was  presumed  to  determine  primary  and  secondary  task  P300  amplitude.  Thus, 
because  a  reciprocal  relationship  between  the  allocation  of  processing 
resources  to  the  two  tasks  was  presumed  to  exist,  a  reciprocal  relationship 
between  primary  and  secondary  task  P300  amplitudes  was  predicted.  This 
prediction  of  primary  and  secondary  task  P300  amplitude  reciprocity  was 
confirmed  in  all  of  the  conditions  in  which  it  was  tested.  Additionally, 
the  zero  slope  of  the  derived  reciprocity  function  is  evidence  that  the 
total  supply  of  resources  available  for  allocation  to  the  primary  or  the 
secondary  task  remained  relatively  constant  for  all  the  tracking  conditions 
employed  in  this  experiment.  Thus,  this  experiment  further  illustrates  the 
utility  of  the  P300  as  a  tool  to  aid  in  the  analysis  of  mental  workload. 
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Table  1 

Order  of  task  presentation 


VELOCITY  IN  ACCELERATION  IN  VELOCITY  IN  ACCELERATION  IN 
ONE  DIMENSION  ONE  DIMENSION  TWO  DIMENSIONS  TWO  DIMENSIONS 


SINGLE 

DUAL 

SINGLE 

DUAL 

SINGLE 

OUAL 

SINGLE 

DUAL 


1 

2 


5 

6 


3 

4 


7 

8 
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Table  2 

RMS  error  and  counting  performance  data 
Standard  Deviations  are  enclosed  in  parentheses 


VELOCITY  IN  ACCELERATION  IN  VELOCITY  IN  ACCELERATION  IN 
ONE  DIMENSION  ONE  DIMENSION  TWO  DIMENSIONS  TWO  DIMENSIONS 


SINGLE  TASK 
RMS  ERROR 

4.30  (4.09) 

41.90 

(15.23) 

5.70  (7.01) 

91.05  (11.32) 

dual-task 

RMS  ERROR 

2.25  (4.32) 

46.33 

(18.81) 

6.75  (7.64) 

95.92  (6.22) 

COUNTING 

ERRORS 

0.85  (0.85) 

1.73 

(1.84) 

1.10  (1.61) 

1.58  (1.67) 
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Table  3 

Mean  range-corrected  P300  amplitude 
Standard  Deviations  are  enclosed  in  parentheses 


VELOCITY  IN  ACCELERATION  IN  VELOCITY  IN  ACCELERATION  IN 
ONE  DIMENSION  .  ONE  DIMENSION  TWO  DIMENSIONS  TWO  DIMENSIONS 


SECONDARY 

69.59 

44.08 

53.02 

19.19 

TASK 

(39.13) 

(34.43) 

(38.53) 

(31.54) 

PRIMARY 

27.39 

53.17 

35.06 

76.65 

TASK 

(34.87) 

(35.58) 

(39.19) 

(32.18) 

Table  4 

Mean  range-corrected  P300  amplitude 
Combined  analysis 

Standard  Deviations  are  enclosed  in  parentheses 


VELOCITY  IN  ACCELERATION  IN  VELOCITY  IN  ACCELERATION  IN 
ONE  DIMENSION  ONE  DIMENSION  TWO  DIMENSIONS  TWO  DIMENSIONS 


SECONDARY 

72.44 

44.01 

55.47 

19.29 

TASK 

(34.50) 

(33.79) 

(39.18) 

(33.77) 

PRIMARY 

28.07 

57.37 

33.67 

77.80 

TASK 

(32.32) 

(35.56) 

(39.14) 

(30.98) 
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Table  5 

Mean  slope  and  intercept  values  Tor  individual  reciprocity  functions 


SUBJECT 

SLOPE 

INTERCEPT 

SUBJECT 

SLOPE 

INTERCEPT 

1 

0.03 

97.35 

21 

0.11 

90.02 

2 

0.16 

102.49 

22 

0.61 

78.47 

3 

0.43 

87.76 

23 

0.32 

95.09 

4 

0.11 

81.95 

24 

-1.32 

164.21 

5 

0.56 

62.72 

25 

-0.79 

94.91 

6 

-0.22 

126.22 

26 

0.22 

86.06 

7 

0.20 

75.72 

27 

-0.57 

109.80 

8 

0.39 

73.00 

28 

0.00 

108.03 

9 

0.41 

70.31 

29 

-0.45 

137.51 

10 

-1.10 

136.28 

30 

-0.28 

115.70 

11 

0.08 

86.48 

31 

0.11 

84.17 

12 

0.88 

61.95 

32 

-0.23 

92.97 

13 

-0.45 

116.39 

33 

0.06 

71.55 

14 

-0.21 

105.22 

34 

1.17 

39.32 

15 

-1.21 

124.00 

35 

0.24 

91.04 

16 

-0.41 

114.11 

36 

0.44 

84.86 

17 

-0.13 

73.38 

37 

-0.42 

109.49 

18 

0.08 

93.47 

38 

0.47 

99.51 

19 

0.94 

35.61 

39 

1.32 

23.11 

20 

0.24 

92.36 

40 

-0.23 

126.94 

Mean  slope  =  0.04 

Std.  error 

=  0.09 

Mean 

intercept  =  92.99 

Std.  error 

=  4.47 
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Figure  1  The  scalp  distribution  of  the  grand  average  ERPs  elicited  by 
secondary  task  target  tones  during  the  dual-task  tracking  conditions. 

Figure  2  Panel  A)  Grand  average  waveform  +/-  one  standard  deviation  unit 
for  secondary  task  ERPs.  Panel  B)  Component  loadings  for  the  first  five 
components  extracted  from  a  PCA  of  the  secondary  task  ERPS. 

Figure  l  The  scalp  distribution  of  the  grand  average  ERPs  elicited  by  the 
primary  task  step  changes  for  the  different  step  tracking  conditions. 

Figure  4  The  effect  of  increased  system  order  upon  the  primary  task 
parietal  waveforms  is  shown.  The  cross-hatched  areas  indicate  increased 
positivity  as  a  function  of  increased  system  order. 

Figure  5  Panel  A)  Grand  average  waveform  with  +/-  one  standard  deviation 
unit  for  the  primary  task  waveforms.  Panel  B)  Component  loadings  for  the 
first  4  components  extracted  from  a  PCA  of  the  primary  task  ERPs. 

Figure  6  Panel  A)  Grand  average  waveform  with  +/-  one  standard  deviation 
unit  for  the  combined  primary  and  secondary  task  ERPs.  Panel  8)  Component 
loadings  for  the  first  5  components  extracted  from  a  PCA  of  combined  primary 
and  secondary  task  ERPs. 

Figure  7  Mean  range  corrected  primary  and  secondary  task  P300  amplitudes 
for  all  of  the  dual-task  tracking  conditions  plotted  as  a  function  of  the 
associated  RMS  error  score.  The  reciprocity  function  represents  the  sum  of 
the  primary  and  secondary  task  amplitude  measures. 
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-  Abstract 

This  study  assesses  the  use  of  a  conditioning  paradigm  with  rabbits  as 
a  model  system  for  Identifying  neural  substrates  of  the  late  positive  (P300) 
component  of  the  human  event-related  potential.  Multi-unit  and 
macropotential  responses  In  five  limbic  system  structures  (the  anterior  and 
posterior  cingulate  cortices  (ACC  and  PCC],  the  dentate  gyrus  ( DG ) ,  and  the 
anterior  ventral  and  medial  dorsal  thalamic  nuclei  (AVN  and  MOM))  were 
recorded  during  the  acquisition  and  performance  of  locomotory  conditioned 
responses.  The  behavior,  performed  In  an  activity  wheel  In  response  to  a 
0.5  sec.  tone  conditional  stimulus  (CS+),  prevented  the  occurrence  of  a 
shock  unconditional  stimulus  (US)  scheduled  5  sec.  after  CS+  onset.  The 
rabbits  also  learned  to  Ignore  a  different  tone  (CS-),  not  predictive  of  the 
US.  Training  was  given  dally  (120  trials,  60  with  each  stimulus,  in  a 
random  order),  until  discrimination  between  the  CSs  reached  criterion.  The 
Intertrial  Interval  (ITI)  varied  from  5  to  25  sec.  In  a  random  sequence. 
After  criterion  (Experiment  1),  standard  asymwtrlc  probability  (AP) 
sessions  were  given  In  which  the  CS+/CS-  proportions  were  .2/. 8  or  .8/. 2. 

The  standard  AP  sessions  were  the  same  as  conditioning  sessions  except  for 
the  probability  manipulation.  In  addition,  nonstandard  AP  sessions 
Involving  a  brief  (1  sec)  ITI,  no  US,  and  behavioral  response  prevention,- 
were  presented  before  training  and  at  several  behavioral ly-deflned  stages  of 
acquisition.  A  significant  discriminative  response,  l.e.,  a  greater  unit 
discharge  to  the  CS+  than  to  the  CS-,  developed  In  all  regions  during  the 
course  of  behavioral  acquisition.  Generally,  the  magnitude  of  the  unit 
responses  elicited  by  rare  CSs  exceeded  that  elicited  by  frequent  CSs  and  by 
equlprobable  CSs.  Ourlng  the  standard  AP  sessions,  the  ACC  and  the  OG 
exhibited  these  effects  In  trained  rabbits  Independently  of  CS  relevance. 

The  minimal  PCC,  AVN  and  MON  responses  to  the  CS-  In  trained  rabbits  was 
Increased  In  the  standard  AP  sessions  when  the  CS-  was  presented  rarely,  but 
the  ample  response  to  the  CS+  In  these  structures  was  not  altered,  or  It  was 
reduced  by  rare  presentation.  Response  amplitudes  In  all  areas  were  reduced 
during  the  nonstandard  AP  sessions,  compared  to  standard  sessions,  but  In 
all  areas  the  responses  elicited  by  the  rare  CSs  exceeded  those  elicited  by 
the  equlprobable  and  frequent  CSs  In  these  sessions.  In  Exp  2,  standard  AP 
sessions  were  presented  to  trained  rabbits  with  lesions  of  the  sublcular 
complex  of  the  hippocampal  formation.  In  order  to  study  the  effects  of 
Interrupting  the  hippocampal  projection  to  the  PCC  and  AVN.  The  unit  and 
macropotential  discriminative  responses  In  the  AVN  were  enhanced,  and  the 
macropotential  responses  to  the  rare  CS+  In  the  PCC  and  AVN  were  enhanced  In 
rabbits  with  lesions,  relative  to  controls.  These  data  are  discussed  in 
relation  to  current  models  of  P300  and  limbic  system  functions,  and 
implications  for  the  localization  of  the  P300  In  humans  are  considered. 
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Event -related  potentials  (ERPs)  can  be  extracted  from  the  ongoing  EEG 
activlty-that  Is  recorded  from  the  scalp  In  human  subjects.  These 
potentials  constitute  that  portion  of  the  EEG  that  Is  time-locked  to  the 
eliciting  event  (6).  Considerable  evidence  has  accumulated  In  the  past  two 
decades  Indicating  that  specific  components  of  ERPs  can  be  recorded  reliably 
under  fairly  well-defined  experimental  circumstances.  Some  of  these 
components  have  been  studied  In  sufficient  detail  to  permit  development  of 
theories  about  the  functional  relevance  of  the  Intracranial  generators 
(12,14,16,30,41).  However,  there  Is  as  yet  little  empirical  Information 
about  the  nature  of  this  Intracranial  activity.  Its  sites  of  origin  or  Its 
functional  relevance.  We  report  here  an  Investigation  of  multi-unit 
activity  and  intracranial  macropotentials  recorded  In  limbic  system 
structures  In  awake,  behaving  rabbits  In  response  to  stimuli  that  varied  In 
terms  of  their  relative  frequency  of  presentation  (l.e.,  probability)  and 
behavioral  relevance.  The  results  suggest  that  the  activity  elicited  by 
these  stimuli  Is  analogous  to  the  P300,  one  of  the  well-defined  ERP 
components. 

The  P300  was  first  described  by  Sutton  (63).  It  appears  as  a  wave  of 
positive  polarity  with  a  latency  of  at  least  300  msec  In  response  to  events 
that.  In  Sutton's  phrase,  "resolved  the  subject's  uncertainty".  Subsequent 
work  has  shown  that  the  P300  amplitude  Is  sensitive  to  both  the  behavioral 
relevance  of  the  eliciting  stimuli  (l.e.,  whether  or  not  the  stimuli  call 
for  action  by  the  subject)  and  the  probability  of  the  stimuli.  The 
relationship  between  these  two  variables  remains  somewhat  controversial. 
However,  It  Is  clear  that  events  must  have  behavioral  relevance  if  they  are 
to  elicit  the  P300,  and  that  under  normal  circumstances  the  lower  the 
probability  of  the  event,  the  larger  will  be  the  P300  that  It  elicits. 

The  data  supporting  these  conclusions  have  been  reviewed  by  Pritchard 
(54).  The  same  data  also  form  the  basis  for  several  theoretical 
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Interpretations  of  the  P300.  Particularly  useful  In  subsuming  many  of  the 
empirical  findings  Is  the  theoretical  account-proposed  by'Donchln  et 
al.(12,14),  stating  that  the  P300  Is  a  manifestation  of  activity  Invoked 
when  there  is  a  need  to  update  templates  In  the  subject's  working  memory. 

Yet,  alternative  theories  of  P300  are  also  consistent  with  the  data 
(9,10,56,57),  and  there  Is  no  present  means  to  discriminate  decisively  among 
these  alternatives. 

The  reasons  for  this  are  both  technical  and  theoretical.  There  Is  some 
controversy  regarding  the  number  of  “components*  that  are  In  fact  active  at 
any  given  latency  (57).  Furthermore,  it  Is  possible  that  different 
constellations  of  Intracranial  activities  yield  similar  configurations  of 
,  activity  at  the  surface  of  the  skull  (69).  If  this  Is  true,  the  theory  or 
the  P300  will  remain  Indeterminate  unless  Information  about  Intracranial 
activity  Is  available.  Thus,  one  reason  to  Investigate  the  Intracranial 
origins  of  the  P300  has  to  do  with  the  measurement  of  this  potential:  such 
Information  can  Indicate  whether  the  P300  reflects  a  collection  of 
functionally  disparate  Informational  processes  In  the  brain,  or  a  process 
that  Is  unitary.  A  second  reason  to  carry  out  such  Investigations  Is  that 
knowedge  from  neuroscience  about  the  behavioral  functions  of  the  Involved 
neural  systems  can  facilitate,  and  may  decisively  establish,  the  functional 
Interpretation  of  the  P300. 

The  need  to  elucidate  the  neural  origins  of  such  components  as  the  P300 
1$  being  addressed  by  three  main  avenues  of  research.  When  clinical 
circumstances  have  permitted  It,  Investigators  have  replicated  ERP  results 
derived  from  scalp  recordings,  using  Intracranial  electrodes  In  humans 
(33,40,42,43,60,61,71,72,73,74).  In  addition,  several  groups  are  attempting 
to  record  the  magnetic  equivalent  of  the  human  P300  (34,48,55).  This 
research  will  become  Increasingly  Important  as  the  technology  for  the 
recording  of  magnetic  fields  develops,  thus  permitting  more  powerful 
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Inferences  about  Intracranial  sources.  The  use  of  animal  models  to  study  the 
Intracranial  origins  of  the  P300  Is  an  obvious  approach  to  the  problem, 
assuming  that  animals  exhibit  phenomena  analogous  to  the  P300  In  humans. 
Indeed,  studies  of  ERPs  In  a  variety  of  non-human  species  have  Indicated 
that  a  wave  very  similar  to  the  P300  can  be  recorded  with  eliciting 
conditions  similar  to  those  used  for  study  of  the  P300  In  humans 
(1,4,19,27,31,35,36.37,39,40,44.45,46,47,51,52,59,67)  A  particular 
advantage  of  this  approach  is  the  opportunity  for  localizing  the  recorded 
potentials.  Just  as  for  surface  ERPs,  the  Interpretation  of  Intracranial 
ERPs  can  be  confounded  by  voltage  signals  originating  at  loci  distant  from 
the  recording  site.  However,  this  problem  can  be  addressed  in  the  cerebral 
cortex  by  obtaining  laminar  ERP  profiles  that  permit  the  derivation  of  thd-*" 
signal  sources  (69).  Moreover,  the  sites  of  origin  of  single-  and 
multi-unit  activity  are  easily  determined  as  this  activity  originates  In  the 
immediate  vicinity  of  the  recording  probe  (3). 

In  the  present  paper  we  report  the  first  application  of  multi-unit 
recording  In  behaving  animals  to  the  problem  of  localizing  brain  activities 
analogous  to  the  P300.  The  data  were  obtained  from  rabbits,  while  they  were 
engaged  In  a  discriminative  avoidance  task.  In  this  task  the  rabbits  learn 
to  respond  when  they  hear  a  shock-predictive  tone  (CS+)  and  to  Ignore  a 
different  tone  (CS-),  not  predictive  of  shock.  Several  considerations  led 
us  to  choose  this  particular  model.  First,  past  work  has  demonstrated  that 
event-related  single-  and  multi-unit  activity  can  be  recorded  from  cortical 
and  thalamic  structures  nf  the  limbic  system  during  acquisition  and 
performance  In  this  paradigm.  This  activity  Is  similar  in  certain  respects 
to  the  P300  (27).  The  structures  of  Interest  were  the  anterior  and 
posterior  cingulate  cortices  (ACC  and  PCC),  the  medial  dorsal  and  anterior 
ventral  nuclei  of  the  thalamus  (the  MON  and  AVN),  and  certain  areas  of  the 
hippocampal  formation.  Of  particular  Interest  was  the  sensitivity  of  the 
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neurons  In  these  structures  to  variations  of  the  behavioral  relevance  of  the 
el Icltlng-  stlmul 1  (20,22,25,50).  That  Is,  distinctive  neuronal  discharges 
developed  In  response  to  CS+  presentations,  relative  to  the  responses  to  the 
CS-,  as  the  rabbits  apprehended  the  different  associative  significances  of 
these  stimuli.  Sensitivity  to  the  behavioral  relevance  of  eliciting  stimuli 
is,  as  noted  above,  a  hallmark  of  the  P300.  Furthermore,  the  theory  of  the 
limbic  system's  role  In  discriminative  conditioning  that  Is  based  on  the 
data  from  rabbits  (23,29)  Is  Intrlgulngly  compatible  with  the  mnemonic 
Interpretation  of  the  functional  significance  of  the  human  P300  (12,14). 

Our  past  work  has  demonstrated  robust  unit  sensitivity  In  the  limbic 
structures  to  the  behavioral  relevance  of  the  CSs.  In  the  first  experiment 
(Exp  1),  we  attempt  to  extend  the  analogy  between  the  unit  activity  and  the" 
the  P300  by  examining  the  degree  to  which  the  same  structures  exhibit 
sensitivity  to  the  manipulation  of  the  probability  of  the  stimuli. 

A  rather  substantial  body  of  evidence  supports  the  Idea  that  the 
hippocampal  formation  Is  Involved  Importantly  In  the  processing  of  rare 
and/or  behavlorally  relevant  stimuli  (8,29,32,53,64).  This  viewpoint,  as 
well  as  the  rather  striking  potentials  recorded  In  the  hippocampus  In  human 
patients  responding  to  rare  and  relevant  stimuli  (33,43)  suggest  that  the 
hippocampal  formation  may  be  Involved  In  the  generation  of  the  P300. 

Indeed,  It  Is  possible  that  the  hippocampus  may  be  a  primary  site  of  neural 
sensitivity  to  stimulus  relevance  and  probability,  and  that  the  exhibition 
of  this  sensitivity  In  nonhlppocampal  areas  may  be  owed  to  Influences 
projected  from  the  hippocampus.  Experiment  2  provides  data  relevant  to  this 
hypothesis.  In  the  form  of  macropotentlals  and  unit  activity  recorded  from 
the  PCC  and  AVN,  both  major  projection  targets  of  the  hippocampal  formation 
(65).  These  records  were  obtained  In  Intact  rabbits  and  In  rabbits 
previously  given  bilateral  electrolytic  lesions  In  the  sublcular  complex, 
the  region  of  the  hippocampal  formation  In  which  efferent  projections  to  the 
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PCC  and  AVN  originate  (65).  The  results  indicate  that  the  responses  to 
stimulus-re+evance  and  probability  In  the  PCC  and  AVN  are  not  hindered, 
indeed  certain  aspects  of  the  response  are  enhanced,  in  rabbits  with 
disrupted  hippocampal  afferents. 


METHODS 

Subjects,  Surgical  Procedures.  Electrodes  and  Target  Areas 

The  subjects  were  male.  New  Zealand  White  rabbits,  weighing  I. 5-2.0  kg 
at  the  time  of  their  delivery  to  the  laboratory.  Seventeen  and  29  rabbits 
served  as  subjects  In  Experiments  1  and  2,  respectively.  They  were 
maintained  on  ad  libitum  water  and  food  throughout  the  experiments. 

Following  a  minimum  period  of  48  hr,  surgical  anaesthesia  was  Induced  by 
Injection  of  12.5  mg/kg  of  chi or promazine  In  solution  (25  mg/ml),  followed 
after  ten  minutes  by  25.0  mg/kg  of  sodium  pentobarbital  In  solution  (50 
mg/ml).  Into  the  marginal  vein  of  the  pinna.  The  rabbits  were  placed  In  a 
headclamp  and  the  skull  was  exposed  and  prepared  for  stereotaxic 
Implantation  of  the  electrodes.  Supplemental  Injections  of  sodium 
pentobarbital  in  solution  (0.07  mg/kg)  were  given  at  0.5-hr  Intervals  to 
maintain  anesthesia. 

The  electrodes  were  made  from  stainless-steel  Insect  pins  insulated 
with  epoxyllte  (outside  diameter:  0.5  -  0.8  m).  Recording  surfaces  were 
formed  by  removing  Insulation  from  the  tip  (tip  lengths:  20-60  microns; 
impedance:  0.25  -  2  megohms).  Neuronal  activity  was  monitored  acoustically 
and  with  an  oscilloscope  during  electrode  lowering.  Two  stainless-steel 
machine-screw  (256  x  1/8)  electrodes  for  recording  epidural  EEC  were 
threaded  into  the  skull  flush  with  the  skull  undersurface.  A 
stainless-steel  screw  threaded  Into  the  frontal  sinus  served  as  the 
reference  electrode.  Once  lowered,  tne  electrodes  and  a  miniature  multipin 
connector  to  „h1ch  they  were  pre-soldered,  were  affixed  to  the  skull  with 
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dental  acry.lic. 

The  targets  of  the  Intracranial  electrodes  were:  the  anterior 
cingulate  cortex  (ACC:  8rodmann's  Area  24),  the  posterior  cingulate  cortex 
(PCC :  Areas  29b  and  29c),  the  dentate  gyrus  (DG),  the  medlodorsal  thalamic 
nucleus  (MON)  and  the  anteroventral  thalamic  nucleus  (AVN).  Four  of  these 
sites  were  chosen  for  each  rabbit. 

The  stereotaxic  coordinates,  based  on  the  atlas  of  Flfkova  and  Marsala 
(5),  were  AP  -4.0,  L  0.9-1. 5,  OV  2.0-3. 0  for  the  ACC;  AP  4.0,  L  0.8,  OV  1.0- 
2.0  and  AP  9.0,  L  2.0,  OV  4.0,  respectively,  for  the  dorsomedlal  and 
ventrolateral  PCC;  AP  6.0,  L  6.5,  DV  3. 5-4.0  for  the  DG;  AP  4.5,  L  1.2,  DV 
8.5  for  the  MON;  and  AP  1.8,  L  2.3,  OV  7. 5-8.0  for  the  AVN.  The  data  for 
the  two  PCC  subfields  were  combined.  The  surface  electrodes  were  placed  I-1" 
hr  from  the  mldllne,  5. 0-6.0  mm  anterior  to  bregma,  overlying  Area  8,  or 
5. 0-9.0  mm  posterior  to  bregma,  overlying  Area  29d.  These  sites  are 
referred  to  respectively  as  the  anterior  surface  (AS)  and  the  posterior 
surface  (PS). 

Procedures  of  Behavioral  Training. 

After  a  minimum  of  seven  days  following  surgery,  each  rabbit  received 
avoidance  conditioning  In  an  activity  wheel,  a  replica  of  the  apparatus  of 
Brogden  and  Culler  (2).  The  wheel  was  located  In  a  shielding  chamber  housed 
In  a  room  adjacent  to  the  one  containing  the  computer  that  controlled  data 
collection.  An  exhaust  fan  and  a  speaker  produced  a  masking  noise  of  70  dB 
re  20  N/m.  The  conditional  stimuli  were  pure  tones  (1  or  8  kHz,  85  d8  re  20 
N/m,  500  ms  In  duration,  rise  time-3  ms)  played  through  a  speaker  directly 
above  the  wheel.  The  unconditional  stimulus  (US)  was  a  constant  AC  current 
(1.5-2. 5  ma)  delivered  through  the  grid  floor  of  the  wheel.  A  response  was 
defined  as  any  wheel  rotation  exceeding  2  degrees.  The  US  current  was 
established  for  each  rabbit  at  the  outset  of  conditioning,  as  the  minimum 
value  needed  to  reliably  elicit  responses. 
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Ourlng  training,  onset  of  the  positive  conditional  stimulus  (CS+)  was 
followed~5  -seconds  later  by  US  onset.  The  CS+  and  US  were  terminated  by 
responses.  A  response  after  CS+  onset  and  prior  to  US  onset  prevented  the 
US.  The  maximum  duration  of  the  US,  given  failure  of  response,  was  one 
second.  Presentations  of  the  negative  conditional  stimulus  (CS-;  at  the 
frequency  (1  or  8  kHz ]  not  used  for  the  CS+)  were  randomly  Interspersed  with 
the  CS+  presentations.  The  CS-  was  also  response-terminated,  but  it  was  not 
followed  by  the  US.  The  Interval  between  the  end  of  a  trial  (defined  as 
offset  of  the  CS  or  of  wheel  rotation  when  locomotion  occurred)  and  the 
onset  of  a  new  trial  (CS  onset)  was  10,  15,  20  or  25  sec.  These  values 
occurred  In  a  random  sequence.  Responses  during  the  Interval  reset  It. 

The  sequence  of  CS+  and  CS-  for  a  given  training  session  was  one  of  ^ 
three  pseudorandom  sequences  designed  to  minimize  the  subject's  ability  to 
predict  the  next  CS,  and  to  minimize  the  likelihood  that  the  subject  would 
be  Influenced  by  higher-ordered  sequential  conditional  probabilities.  The 
sequences  were  selected  on  a  rotating  basis  throughout  training  for  a  given 
subject. 

Each  subject  received  training  (120  trials  dally,  60  with  the  CS+  and 
60  with  the  CS-)  until  the  proportion  of  avoidance  responses  (l.e., 
responses  to  the  CS+  before  US  onset)  exceeded  the  proportion  of  responses 
to  the  CS-  by  .60  or  more  In  any  60-trlal  block.  This  performance  had  to 
occur  In  two  consecutive  sessions.  This  criterion  yields  an  asymptote  of 
discriminative  behavior  not  exceeded  with  further  training. 

Pretralnlnq.  Before  conditioning,  two  pretraining  (PT)  sessions  were 
given.  In  the  first,  the  tones  later  used  as  CSs  were  presented  alone,  and 
In  the  second,  the  tones  and  the  US  were  presented  In  a  noncontlgent 
(explicitly  unpaired)  fashion.  The  frequency  and  temporal  distribution  of 
the  US  during  PT  were  Identical  to  the  average  values  obtained  during  the 
first  session  of  conditioning  in  a  sample  of  100  rabbits.  The  only 
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deviation  from  this  procedure  was  that  in  PT,  the  US  was  never  presented  in 
the  Interval  from  three  seconds  before  to  three  seconds  after  a  CS.  The  PT 
sessions  served  to  provide  a  baseline  of  neuronal  and  behavioral  responses 
to  the  CSs,  for  comparison  with  the  responses  that  developed  during 
training. 

Standard  Asymmetric  Probabi 1 i ty  Sessions.  Following  criterion 
attainment,  the  rabbits  received  two  standard  asymmetric  probability  (AP) 
sessions.  In  the  first  of  these,  ten  of  the  rabbits  received  the  CS+  and 
CS-  In  .2  and  .8  of  the  trials,  respectively.  These  proportions  were 
interchanged  In  the  second  session.  Seven  additional  rabbits  received  these 
treatments  In  the  opposite  order.  Following  the  two  standard  AP  sessions, 
all  of  these  rabbits  received  a  standard  training  session  in  which  the  CSs 
were  presented  equally  often. 

Nonstandard  AP  Sessions.  These  sessions  Involved  200  CS  presentations 
with  a  brief  (1-sec.)  Interstimulus  Interval,  the  prevention  of  wheel 
movement,  and  no  US.  None  of  the  rabbits  attempted  to  locomote  In  the  wheel 
after  the  first  few  trials.  This  procedure  was  designed  to  approximate  the 
"oddball"  paradigm  used  In  studies  of  ERPs  with  human  subjects  (13). 

The  nonstandard  AP  sessions  were  given  2-5  min.  after  the  end  of  the  PT 
sessions  and  after  training  sessions  representing  certain  landmarks  of 
behavioral  acquisition.  These  were  the  first  session  of  conditioning,  the 
session  in  which  a  significant  behavioral  discrimination  first  occurred,  and 
the  second  of  the  two  sessions  In  which  the  acquisition  criterion  was  met. 
The  session  In  which  the  first  significant  behavioral  discrimination 
occurred  was  defined  as  the  session  In  which  the  proportion  of  avoidance 
responses  to  the  CS+  exceeded  that  to  the  CS-  by  .25  or  more.  This  value 
approximates  the  minimum  required  to  produce  a  significant  chi-square 
(£■•05)  for  a  difference  between  correlated  proportions  (66). 
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A  subset  of  the  rabbits  (N-9)  received  only  one  nonstandard  AP  session 
at  each 'training  stage.  In  these  sessions  the  only  asymmetric  probability 
treatment  was  the  rare  presentation  (l.e..  In  .2  of  the  trials)  of  the  CS+. 
The  remaining  rabbits  (N»8)  received  two  nonstandard  AP  sessions,  one  with 
rare  and  the  other  with  frequent  CS+  presentations  In  a  counterbalanced 
order.  The  duration  of  the  nonstandard  AP  sessions  was  about  10  min. 

Sublcular  lesions.  The  rabbits  with  sublcular  lesions  (N-10)  and 
controls  (N.*18)  received  standard  conditioning  and  overtraining  as  described 
above,  but  nonstandard  AP  sessions  were  not  given  to  these  rabbits.  The 
data  reported  here  were  recorded  during  the  third  postcrlterlal  overtraining 
session  and  during  a  single  standard  AP  session  with  rare  CS+  presentation 
(CS+/CS-  proportions  «  .2/ .8).  The  lesions  were  made  at  two  sublcular  loci, 
one  anteromedial  and  another  posterolateral,  along  the  septotemporal  axis  of 
the  hippocampal  formation.  The  coordinates  (anteromedial  site:  P  6.00,  L 
2.50,  V  5.00;  posterolateral  site:  P  7.50,  L  4.25,  V  4.00)  were  derived  from 
the  atlas  of  Flfkova  and  Marsala  (5).  The  leslonlng  electrodes  were  made 
with  stainless-steel  Insect  pins  coated  with  epoxyllte.  The  Insulation  was 
removed  0.50-0.75  mi.  from  the  pointed  ends  of  the  pins.  A  current  of  1.5 
ma  was  delivered  at  each  site  for  30  or  45  sec. 

Recording  and  Analysis  of  Neural  Activity. 

Throughout  training,  the  neural  records  were  fed  Into  a  field  effect 
transistor  (FET)  which  served  as  a  high  Impedance  source-follower  located 
about  2.5  cm  from  the  recording  site.  The  FET  outputs  were  split,  one  limb 
entering  single-ended  preamplifiers  with  gain  (4000)  and  bandwidth  (1/2 
amplitude  cutoffs  at  0.1  and  20  Hz)  suitable  for  ERP  signals.  The  other 
limb  entered  preampl If lers  with  gain  (10,000)  and  bandwidth  (1/2  amplitude 
cutoffs  at  600-10,000  Hz)  suitable  for  recording  unit  action  potentials. 

The  outputs  of  the  unit  preamplifiers  were  fed  through  active  bandpass 
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filters  (i/2  amplitude  cutoffs  at  600  and  10,000  hz,  rolloff »18  db/octave) 
to  remove  any  residual  slow  potential  activity.  Outputs  of  the  band-pass 
filters  were  fed  in  parallel  Into  Schmitt  triggers  and  Into  a  circuit  for 
full-wave  rectification  and  RC  Integration  (3).  The  Schmitt  trigger  outputs 
measured  the  firing  frequency  of  the  largest  three  or  four  spikes  on  each 
record,  whereas  the  Integrated  activity  measured  energy  fluctuations  of  the 
entire  record.  Including  activity  below  the  triggering  thresholds.  The  rise 
and  fall  time  constants  of  the  Integrators  were  15  and  75  ms,  respectively. 

Each  Schmitt  trigger  produced  a  digital  pulse  each  time  a  unit  spike 
exceeded  the  preset  voltage.  The  output  pulses  were  fed  Into  an  LSI  11/23 
computer  programmed  to  process  the  neural  data  and  to  control  the  behavioral 
experiment,  on-line.  The  triggering  levels  were  set  Independently,  under ^ 
computer  control,  such  that  the  mean  rate  of  pulses  fell  within  limits  of  ' 
110-190  per  second.  Preceding  each  trial,  the  trigger  setting  was  adjusted 
upward  or  downward  automatically  to  compensate  for  occasional  shifts  In 
baseline  activity. 

The  macropotentlal  and  unit  records  were  digitized  and  the  Schmitt 
trigger  pulses  were  counted  for  each  10  ms  Interval,  for  a  total  duration  of 
1.0  second,  300  ms  before  and  700  ms  after  CS  onset. 

Several  procedures  were  used  to  ensure  that  neural  and  non-neural 
(e.g.,  electromyographic)  electrical  activity  resulting  from  movement  did 
not  Influence  the  data.  First,  the  mean  latency  of  the  avoidance  response 
over  all  training  sessions  was  greater  than  three  seconds,  but  the  analysis 
Involved  only  neuronal  results  In  the  Initial  400  ms  after  CS  onset. 
Infrequently,  responses  occurred  during  the  first  500  ms  after  CS  onset. 

Such  trials  were  discarded  and  repeated.  When  non-locomotor  movements 
(e.g.,  licking,  sneezing,  grooming)  occurred  during  the  300  ms  pre-CS  period 
or  In  the  Initial  400  ms  period  following  CS  onset,  the  artifact  generated 
was  detected  by  a  Schmitt  trigger,  resulting  In  automatic  rejection  and 
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repetition  Qf.  the  trial.  In  addition,  the  criterion  for  setting  trigger 
levels  relative  to  the  neuronal  records  resulted  in  a  constant  baseline  rate 
of  Schmitt  trigger  spikes  In  the  range  of  110-190  per  second.  With  this 
criterion,  the  expected  number  of  counts  per  Interval  In  the  quiet  rabbit 
was  about  1.5.  Neuronal  bursts  produced  routinely  counts  of  5  or  6,  but 
counts  of  8  or  more  were  rare  unless  artifacts  occurred.  Trials  In  which  six 
or  more  of  the  120  pre-CS  10-ms  Intervals  (30  Intervals  from  each  of  the 
four  channels)  had  counts  of  8  or  more  were  automatically  discarded  and 
repeated.  Finally,  the  experimenter  rejected  trials  with  a  single  keyboard 
command  If  any  sound  of  movement  was  detected  on  the  audio  monitor 
immediately  before  or  after  CS  onset.  Previously,  electrical  effects  from 
-several  electrodes  located  Inadvertently  In  nonneuronal  regions  of  the  — - 

Intracranial  cavity  (e.g.,  the  mldllne,  the  cerebral  aqueduct)  were 
analyzed.  Significant  and  systematic  ■response"  activity  to  the  tone 
stimuli  for  records  of  this  kind  was  not  found  In  any  stage  of  conditioning. 

The  activity  sampled  on  each  trial  was  stored  on  digital  magnetic  tape 
for  subsequent  processing.  Histograms  Indicating  the  average  CS+  and  CS- 
ellclted  firing  frequencies.  Integrated  activity,  and  macropotentlals  were 
displayed  continuously  on  a  VT100  graphics  terminal  as  they  were  being 
formed  during  the  sessions.  The  offline  analysis  of  the  taped  data  Involved 
the  computation  of  standard  scores  (z-scores)  for  each  of  the  70  10-ms 
Intervals  after  CS  onset.  Each  score  was  obtained  by  subtracting  the  mean 
of  the  30  pre-CS  Intervals  from  the  value  In  each  of  the  70  post-CS 
Intervals  and  dividing  the  difference  by  the  standard  deviation  of  the 
pre-CS  Intervals. 

Because  the  number  of  sessions  required  to  attain  the  behavioral 
criterion  varied  among  the  subjects,  the  statistical  analysis  focused  on 
three  behavioral ly-deflned  stages  of  training  coaaon  to  all  subjects:  a)  A 
pretraining  stage  Including  the  combined  data  of  the  two  pretraining 
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sessions;  b)  An  Intermediate  stage.  Including  the  first  session  of  training 
and  the -session  of  the  first  significant  behavioral  discrimination;  and  c) 

An  asymptotic  training  stage.  Including  the  data  obtained  during  the  two 
criteria!  sessions  and  the  single  postcrlterlal  session  with  symmetric  CS 
probability.  The  neural  data  of  the  separate  sessions  within  each  of  these 
stages  were  pooled  by  averaging  the  2- scores.  Oata  from  the  nonstandard  AP 
sessions  that  were  given  Inwedlately  after  the  standard  training  sessions 
were  also  pooled  In  this  way. 

Oata  from  the  standard  training  sessions  were  submitted  to  analysis  of 
variance,  with  orthogonal  factors  of  stage  (3  levels:  pretraining, 
intermediate  training,  and  asymptotic  training),  CS  type  (two  levels:  CS+ 
and  CS-)  and  Interval  after  CS  onset  (40  levels:  each  10-ms  Interval).  Data 
from  the  nonstandard  AP  sessions  were  submitted  to  analysis  of  variance, 
with  orthogonal  factors  of  CS  type  (two  levels:  CS+  and  CS-)  and  Interval 
after  CS  onset  (40  levels:  each  10-m$  Interval).  Separate  analyses  were 
performed  for  the  data  at  each  training  stage.  The  data  from  the  single 
overtraining  session  and  the  two  standard  AP  sessions  were  submitted  to  an 
analysis  of  variance  with  orthogonal  factors  of  CS  type,  interval  after  CS 
onset  and  session  (3  levels:  rare,  frequent  and  equlprobable  CS+). 

Given  a  significant  overall  F,  Individual  comparisons  of  mean  response 
magnitudes  were  carried  out  (£<.05)  using  Fisher's  protected  least 
significant  difference  test  (68). 

Histological  Identification  of  Recording  Sites  and  Lesions. 

After  testing,  each  rabbit  received  an  overdose  of  sodium 
pentobarbital  followed  by  transcardlal  perfusion  with  normal  saline  and  10% 
formalin.  Each  brain  was  frozen  and  sectioned  at  40  microns.  The  sections 
containing  the  electrode  tracks  and  lesions  were  photographed  while  still 
wet  (21).  After  photography,  the  sections  were  stained  for  Nlssl  and  myelin 
using  formol-thlonln  (17). 


ANIMAL  MOOEL  FOR  P300 


15 


The  numbers  of  unit  and  macropotentlal  records  used  In  the  analyses  are 
shown  Irr  Table  1.  These  numbers  represent  the  final  results  of  the 
histological  analysis  and  additional  screening  which  resulted  In  the 
exclusion  of  macropotentlal  and/or  unit  records  In  certain  sessions  due  to 
poor  quality,  technical  problems,  and.  In  one  case,  failure  to  meet  the 
behavioral  criterion  of  learning.  The  fact  that  subsets  of  the  rabbits  were 
assigned  to  limited  sets  of  the  behavioral  treatments,  as  detailed  in  the 
description  of  the  training  procedures,  also  Influenced  the  numbers  of 
records  In  the  analyses. 

The  dorsal  sublcular  complex  lesions  are  Illustrated  In  Figure  1. 
Additional  description  of  the  lesions,  and  data  yielded  by  the  rabbits  with 
lesions  during  standard  training  sessions,  are  reported  elsewhere  (26,28) 

RESULTS 


A.  OVERVIEW 

In  all  recording  sites,  rare  presentation  of  the  CS-  increased  the 
magnitude  of  the  unit  response  In  trained  rabbits,  relative  to  the  responses 
that  occurred  when  the  CS-  was  presented  more  frequently  than,  or  as 
frequently  as,  the  CS+  (Figure  2).  In  addition.  In  the  ACC,  rare 
presentation  of  the  CS+  increased  the  magnitude  of  the  unit  response 
relative  to  the  other  conditions  of  CS  presentation  (Figure  3).  The 
response  Increments  to  the  rare  stimuli  were  present  at  latencies  as  brief 
as  70  ms  after  CS  onset,  attaining  peak  amplitudes  from  90-300  ms  after  CS 
onset,  depending  on  the  recording  site  (Figure  7). 

The  foregoing  results  were  obtained  from  trained  rabbits  during  the 
standard  AP  sessions  that  were  identical  In  all  respects  to  standard 
avoidance  conditioning  sessions  except  for  the  CS  probability  manipulation. 
A  somewhat  different  pattern  of  results  was  obtained  during  the  nonstandard 
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AP  sessions  that  were  Intended  to  approximate  the  "oddball"  paradigm  found 
effective  In  eliciting  P300  In  humans.  These  sesstons  Involved  high  rate 
(1/sec)  CS  presentations  with  no  US,  and  prevention  of  behavioral  responses. 
The  results  obtained  In  these  sessions  Indicated  that  In  trained  rabbits,  a 
greater  average  unit  discharge  occurred  In  response  to  the  the  rare  CS+  than 
to  the  frequent  CS-.  In  all  but  one  site  (the  AVN),  the  rare  CS-  and  the 
frequent  CS+  elicited  statistically  equivalent,  minimal  average  unit 
responses  (Figure  4).  The  AVN  showed  a  greater  response  to  the  CS+,  even 
when  It  was  presented  frequently  during  these  sessions.  Finally,  the  unit 
responses  of  both  of  the  thalamic  nuclei  exhibited  a  greater  discharge  to 
the  rare  than  to  the  frequent  tones,  during  nonstandard  AP  sessions  given 
before  conditioning.  T* 

With  some  exceptions,  results  similar  to  those  just  described  for  the 
neuronal  activity  measure  were  also  obtained  for  the  Intracranial  and 
epidural  macropotentlals.  These  data  support  the  conclusion  that  the 
neuronal  activity  In  all  of  the  monitored  limbic  localities  exhibits  both  of 
the  response  features,  sensitivity  to  stimulus  relevance  and  probability, 
that  characterize  the  P300. 

Lesions  of  the  sublcular  coaqjlex  of  the  hippocampal  formation  enhanced 
the  discriminative  unit  response  (1.e.f  the  "target  effect")  recorded  from 
the  AVN,  and  they  attenuated  the  response  In  the  PCC.  However,  these 
lesions  did  not  obviously  alter  the  responses  exhibited  In  the  PCC  and  AVN 
during  the  manipulation  of  CS  probability.  This  outcome  Is  In  conformity 
with  other  data  Indicating  that  extracranial  macropotentlals  exhibit  normal 
or  enhanced  sensitivity  to  rare  stimuli  In  monkeys  or  humans  with  damaged 
hippocampi  (38,39,51,72).  Thus,  the  Integrity  of  the  hippocampal  formation 
does  not  appear  to  be  required  for  the  exhibition  of  this  sensitivity. 
Indeed,  the  hippocampal  projections  appear  from  the  present  data,  and  from 
other  results  (26,28),  to  be  Involved  In  1  lilting  the  relevance-  and 


ANIMAL  MOOEL  FOR  P300 


17 


probability-driven  activities  In  these  regions. 

Si  oetailed  results  of  exp  u  manipulation  of  stimulus  relevance  ano 

PROBABILITY 

1.  Behavior 

Behavioral  acquisition  replicated  that  observed  In  past  studies 
(20,29).  The  mean  number  of  training  sessions  prior  to  criterion  was  4.6. 
During  the  final  training  stage  (consisting  of  the  combined  data  from  the 
crlterlal  and  final  overtraining  sessions,  in  which  asymptotic  performance 
occurred),  the  rabbits  avoided  the  US  on  an  average  of  77*  of  the  trials, 
and  they  responded  to  the  CS-  on  fewer  than  9*  of  the  trials.  There  were  no 
significant  effects  of  the  probability  manipulations  on  performance  during 
overtraining. 

2.  Neural  Changes  During  Acquisition 

Overview.  As  In  past  studies,  the  unit  responses  to  the  CSs  Increased 
In  magnitude  and  became  discriminative  during  conditioning.  These  effects 
In  the  ACC,  PCC,  MDN  and  AVN  represent  replication  of  past  findings  with 
this  paradigm.  The  changes  In  the  0G  are  reported  here  for  the  first  time. 
The  amplitudes  of  certain  components  of  the  surface  and  Intracranial 
macropotentlals  Increased  during  training,  but  target  effects  developed  only 
minimally  In  the  macropotentlals,  and  primarily  In  the  epidural  recording 
sites. 

Unit  activity.  The  firing  frequency  and  Integrated  unit  activity 
yielded  qualitatively  similar  outcomes  In  this  study  but  the  Integrated 
activity  measure  was  less  variable  and  more  sensitive  to  the  experimental 
manipulations  than  the  spike  frequency  measure.  Thus,  only  the  results 
obtained  with  the  Integrated  unit  activity  measure  are  presented.  The  full 
set  of  data  are  available  upon  request  (62). 

As  In  past  studies,  the  histogram  profiles  of  the  PCC  and  AVN  were 
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triphasic  Inform,  consisting  of  peak  excitatory  discharges  at  20-40  and  80- 
250  ms,  -and-an  Inhibitory  pause  at  40-100  ms  after  CS  onset  (Figure  5,  rows 
2  and  5).  Also,  as  before,  the  profiles  of  the  ACC  and  the  MON  were 
essentially  blphaslc,  exhibiting  a  rising  excitatory  discharge  that  reached 
a  peak  value  at  approximately  100-150  ms.  This  level  was  maintained  In  the 
MON  throughout  the  remainder  of  the  sampling  Interval  (Figure  5,  row  4), 
whereas  the  activity  In  the  ACC  declined  gradually  after  150  ms  (Figure  5, 
row  1).  An  excitatory  discharge  with  a  200-300  ms  peak  and  subsequent, 
gradual  decline  was  recorded  In  the  06,  an  area  not  previously  studied  In 
this  paradigm  (Figure  5,  row  3). 

The  analyses  Indicated  that  a  discriminative  unit  response  developed 
during  training  In  all  recording  sites.  That  Is,  the  average  response  to-— 
the  CS+  exceeded  the  response  to  the  CS-  for  the  data  of  the  asymptotic 
training  stage,  but  no  such  difference  occurred  during  pretraining  (Figure 
5).  This  conclusion  Is  based  on  the  results  of  Individual  comparisons  of 
the  means  comprising  the  Interaction  of  the  training  stage,  stimulus  and 
Interval  factors.  The  Individual  comparisons  Indicated  that  In  each  of  four 
sites  (PCC,  06,  MON,  AVN),  the  CS+  elicited  significantly  greater  average 
unit  responses  than  the  CS-  In  a  majority  of  the  10-ms  poststimulus 
Intervals  from  100-400  ms  after  CS  onset  during  the  asymptotic  training 
stage,  but  not  during  the  pretraining  stage  (Table  2).  In  addition,  the 
magnitude  of  the  average  CS+  elicited  response  in  the  asymptotic  stage 
exceeded  the  response  recorded  during  pretraining  at  several  Intervals  in 
all  of  these  areas  (Table  2). 

The  training-related  development  of  Increased  unit  response  magnitudes 
during  conditioning  was  Indicated  for  the  ACC  by  significant  two-way 
Interactions  of  stimulus  with  Interval,  and  stage  with  Interval 
(F( 39/429 1-3.36;  £<.0001;  F l 78/858 1 - 1 . 60 ,  £-.0011,  respectively).  The 
Individual  comparisons  based  on  the  first  of  these  Interactions  showed  a 
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significantly  greater  discharge  to  the  CS+  than  to  the  CS-  at  240-270  ms  for 
the  prof-lie*  pooled  across  all  training  stages.  Since  there  was  no 
interaction  of  stimulus  and  training  stage,  these  results  do  not  provide  a 
direct  demonstration  of  development  of  a  discriminative  response  during 
training.  However,  the  occurrence  of  a  significant  overall  discriminative 
response,  and  the  absence  of  any  discriminative  effect  during  pretraining 
{Figure  5,  upper  left  panel)  support  the  conclusion  that  differentiation  of 
CS+  and  CS-  developed  with  training.  In  replication  of  several  past  reports. 
The  comparisons  based  on  the  Interaction  of  training  stage  and  interval 
demonstrated  significant  overall  (nondlscrlmlnatlve)  Increases  In  unit 
activity  In  virtually  all  poststimulus  Intervals  after  100  ms  during  the 
asymptotic  training  stage,  relative  to  pretraining.  — 

Macro potential  prof  lies.  The  waveshapes  of  the  average  Intracranial 
macropotentials  varied  with  the  recording  site,  but  certain  shared  features 
were  also  exhibited.  Thus,  In  all  but  the  PCC,  the  Intracranial  profiles 
showed  a  wave  of  negative  polarity  with  a  peak  value  at  100-200  ms  after  CS 
onset.  Within  this  same  latency  range  the  surface  recordings  exhibited  a 
large  positive  deflection  (Figure  6).  In  addition,  all  but  the  PCC  profile 
exhibited  one  or  two  positive  deflections  from  150-400  ms,  latencies 
dominated  by  negativity  In  the  surface  macropotentlal  records.  Finally,  a 
majority  of  the  Intracranial  records  exhibited  rather  sharp  positive 
potentials  at  brief  latency  (  30-70  ms),  but  small  negative  deflections  were 
observed  in  the  posterior  surface  recordings  within  this  Interval.  Thus, 
the  profiles  of  the  surface  and  Intracranial  potentials  were  approximate 
mirror  Images.  The  only  exception,  the  PCC  profiles,  exhibited  a  negative 
potential  after  150  ms,  and  a  sharp  positive  potential  at  75  ms.  In  advance 
of  the  Initial  positive  potentials  In  the  other  sites. 

The  overall  polarity  opposition  between  the  surface  and  Intracranial 
profiles  Is  In  accord  with  the  traditional  assumption  that  surface  records 
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are  driven  primarily  by  voltages  that  are  the  distal  sources  of  Intracranial 
voltage -sinks,  the  primary  sites  of  neural  activation. 

Macropotentlal  ampl itude  changes  during  acquisition.  The  amplitudes  of 
the  intracranial  and  surface  macropotentials  Increased  during  the  course  of 
behavioral  acgulsltlon,  as  indicated  by  comparison  of  the  profiles  obtained 
during  pretraining  (Figure  6,  left  column),  with  those  exhibited  during  the 
asymptotic  training  stage  (Figure  6,  right  column).  This  conclusion  Is 
based  on  marginally  significant  main  effects  of  training  stage  for  the  ACC 
(0-.O66I),  OG  (£*.0413)  and  MDN  (£*.0582).  The  Increased  amplitudes  were 
indicated,  in  the  case  of  the  AS,  PS,  DG  and  PCC,  by  the  occurrence  of 
significant  Interactions  of  the  stage  and  Interval,  and  of  the  stage. 

Interval  and  stimulus  factors  (Table  3).  Individual  comparisons  Indicated— 
that  the  AS  and  PS  positive  deflections  from  75-200  ms,  and  the  negative 
deflections  from  300  to  400  ms,  Increased  In  the  asy«a>tot1c  training  stage 
relative  to  the  pretraining  stage.  Similar  findings  were  obtained  for  the 
PCC.  However,  in  this  Instance,  the  peak  of  the  sharp  positive  deflection 
occurred  at  a  briefer  latency  than  In  the  surface  records. 

The  principal  amplitude  Increase  In  the  DG  occurred  In  relation  to  the 
profile  elicited  by  the  CS+,  which  was  significantly  reduced  relative  to  the 
CS-  profile  during  pretraining,  but  which  underwent  an  enlargement  during 
the  asymptotic  training  stage.  These  changes  occurred  from  80-170  ms  after 
CS  onset  (Figure  6,  row  5,  and  Table  3). 

Macropotentlal  discriminative  responses  during  acquisition.  The 
Interactions  of  the  training  stage  and  stimulus  factors  by  Interval 
Indicated  that  discriminative  macropotentlal s  developed  during  training,  in 
the  AS,  PS,  PCC  and  06  (Table  3).  Ourlng  pretraining,  the  anterior  surface 
(AS)  macropotentlal  profile  elicited  by  the  CS+  exhibited  no  differences 
relative  to  the  CS-  elicited  profile.  In  the  case  of  the  PS  profile,  and 
for  the  PCC,  the  CS-  elicited  more  ample  potentials  than  did  the  CS+. 
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Ourlng  the  asymptotic  training  stage,  however,  the  amplitudes  of  the  AS  and 
PS  posiMve-potentlals  elicited  by  the  CS-*-  exceeded  significantly  those 
elicited  by  the  CS-.  The  discriminative  responses  in  these  Instances 
occurred  from  60-200  ms  for  the  AS,  and  from  110-180  ms  after  CS  onset  for 
PS  (Figure  6,  rows  1  and  2).  In  the  PCC,  the  potential  elicited  by  the  CS+ 
exceeded  that  elicited  by  the  CS-  In  the  brief-latency  positive  wave  (from 
60-110  ms),  and  at  greater  latencies  ^160-360  ms.  Figure  6,  row  3). 
Discriminative  macropotentlal  development  In  the  D6  occurred  In  the  form  of 
an  Increase  during  asymptotic  training  In  the  potential  elicited  by  the  CS-*-. 
This  potential  had  been  significantly  reduced  relative  to  that  elicited  by 
the  CS-  during  pretraining  (80-170  ms.  Figure  6,  row  5).  However,  the 
potential  elicited  by  the  CS-*-  exceeded  that  elicited  by  the  CS-  In  the  — 

trained  rabbits  (200-210  ms.  Figure  6,  right  column,  row  5). 

Discriminative  macropotentlal  development  In  the  AVN  (Figure  6,  row  6) 
was  suggested  by  a  significant  Interaction  of  the  stimulus  and  Interval 
factors  (F^( 39/1561 -1.87,  £-.0040).  However,  the  Individual  comparisons  were 
not  significant. 

In  summary,  robust  Increases  In  macropotentlal  amplitude  occurred 
during  the  asymptotic  training  stage  In  all  of  the  sites  from  which 
recordings  were  made,  excepting  the  AVN.  Even  In  the  case  of  the  AVN, 
however,  the  nonsignificant  differences  suggested  a  training- Induced 
Increase  In  macropotentlal  amplitude,  a  result  that  has  occurred  In  past 
studies  (28)  and  that  would  have  been  significant  In  this  experiment  given  a 
larger  sample  of  AVN  records. 

In  contrast  to  the  robust  amplitude  increases  In  all  of  the 
macropotentlal  records,  and  In  contrast  to  the  development  of  discriminative 
unit  responses  In  all  of  the  areas,  discriminative  macropotentlal  responses 
developed  only  In  the  epidural  macropotentlal  records,  and  In  the  PCC  and 
OG.  Large  macropotentlal  amplitude  Increases  and  minimal  or  nonexistent 
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discriminative  macropotentlal  effects  have  been  replicated  for  ACC,  PCC  and 
AVN  since-  the  present  observations  were  made  (28). 

3.  Manipulation  of  CS  probabil Ity  during  overtraining:  The  standard  AP 
sessions. 

Prologue.  Each  rabbit  received  *wo  AP  treatments,  during  the  first  and 
second  postcrlterlal  (overtraining)  sessions.  In  which  the  proportions  of 
CS+  to  CS-  presentation  were  either  .2/. 8,  or  .8/. 2.  The  third  overtraining 
session.  In  which  the  stimuli  were  presented  with  egual  freguency,  provided 
data  for  comparison  with  the  two  standard  AP  sessions.  The  order  of  the  two 
AP  sessions  was  counterbalanced.  With  this  arrangement,  the  effects  of  the 
probability  manipulation  were  assessed  by  the  Interaction  of  sessions  and 
stimuli. 

Unit  responses  to  the  CS-.  An  enhanced  unit  response  to  the  rare  CS- 
was  Indicated  by  the  significant  Interactions  of  the  session  and  stimulus 
factors.  The  three-way  Interaction  of  these  factors  with  the  Interval 
factor  was  significant  In  the  analyses  of  ACC,  DG  and  AVN  unit  activity. 
Individual  comparisons  revealed  a  greater  neuronal  response  to  the  CS-  when 
It  was  presented  rarely  than  when  It  was  presented  frequently  and  equally  as 
often  as  the  CS+.  This  occurred  In  a  majority  of  10  ms  Intervals  from  90  to 
400  ms  after  CS  onset  (Figure  7,  Table  4). 

The  Increased  unit  response  of  the  PCC  to  the  rarely  presented  CS-, 
relative  to  the  sym*tr1cal1y  and  frequently  presented  CS-  (Figure  7,  right 
panel,  second  row)  was  indicated  by  a  significant  Interaction  of  the  session 
and  stimulus  factors,  without  significant  participation  of  the  Interval 
factor  (Table  4).  Rare  presentation  of  the  CS-  Increased  the  average 
standard  score  to  a  value  of  7.13  from  values  of  2.90  and  2.55,  In  the 
symmetrically  and  frequently  presented  CS-  conditions  respectively;  however, 
the  Individual  comparisons  for  these  effects  were  not  significant. 

Similarly,  the  average  standard  score  associated  with  rarely  presented  CS- 
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In  the  MON  was  11.92,  compared  to  6.84  and  5.52  for  the  symnetrlcally  and 
frequently  -presented  CS-,  respectively  (Figure  7,  right  panel,  fourth  row). 
However,  the  probability  manipulation  did  not  yield  significant  effects  In 
this  site,  again  perhaps  due  to  the  relatively  small  sample  of  MON  records. 

Unit  responses  to  the  CS+.  Rare  CS+  presentation  did  not  augment  the 
already  ample  responses  to  the  symmetrically  and  frequently  presented  CS+  In 
the  PCC,  AVN  and  MON.  However,  the  rare  CS+  did  Increase  significantly  the 
activity  elicited  In  the  ACC  and  DG.  Individual  comparisons  following  a 
significant  three-way  Interaction  In  the  analysis  of  the  ACC  data  Indicated 
that  the  response  to  the  rarely  presented  CS+  exceeded  the  responses  to  the 
frequently  and  symmetrically  presented  CS+s  from  50  to  400  ms  after  CS  onset 
(Table  4  and  Figure  7,  upper  left  panel).  Similarly,  the  unit  responses  ITT 
the  DG  to  both  the  rarely  and  symmetrically  presented  CS+s  exceeded  the 
responses  to  the  frequently  presented  CS+  In  several  10  ms  Intervals  from 
170  to  390  ms  (Table  4  and  Figure  7,  left  panel,  third  row).  Finally  and 
somewhat  paradoxically,  the  responses  of  the  AVN  to  both  the  rare  and  the 
frequent  CSs  were  reduced  significantly  relative  to  the  response  to  the 
symmetrically  presented  CS+  at  Intervals  from  170-300  ms  (Figure  7,  bottom 
left  panel).  A  similar  but  nonsignificant  reduction  was  also  exhibited  by 
the  MON  (Figure  7,  left  panel,  fourth  row). 

Macro potentials.  In  parallel  with  the  unit  results,  both  of  the  rare 
CSs  elicited  OG  macropotential  responses  of  greater  magnitude  than  those 
elicited  by  the  frequently  and  symmetrically  presented  CSs  (Figure  8,  row 
3).  These  results  were  Indicated  by  a  significant  Interaction  of  stimulus 
and  session  (£12/8]  •  5.29,  £-.0344) .  However,  no  Individual  comparisons 
attained  acceptable  significance  levels  for  this  effect,  and  none  of  the 
remaining  Intracranial  records  yielded  significant  macropotentlal  changes  In 
response  to  the  standard  AP  conditions. 

In  contrast,  robust  effects  of  the  probability  manipulation  occurred  In 
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the  anterior  and  posterior  surface  (AS  and  PS)  raacropotentlals,  as  indicated 
by  significant  3-way  Interactions  of  stage,  stimulus  and  Interval  (Table  5). 
The  AS  activity  elicited  by  the  rare  and  equal  CS+  exeeded  that  elicited  by 
the  frequent  CS+  at  several  Intervals,  from  60  to  140  ms  after  CS  onset. 

These  Intervals  correspond  to  the  large  positive  peak  In  the  AS  record 
(Figure  8,  upper  left  panels;  Table  5).  The  rare  and  equal  potentials  In 
this  Instance  were  of  nearly  Identical  amplitude  In  these  Intervals.  The  PS 
potential  elicited  by  the  rare  CS-*-  exceeded  significantly  the  potentials 
elicited  by  both  the  equal  and  frequent  CS-*-,  again  during  the  large  positive 
wave  from  60  to  200  ms  (Figure  8,  upper  right  panels). 

The  rare  CS-  also  elicited  significantly  greater  positive  potentials 
than  the  frequent  and  equal  CS-  In  the  AS  and  PS,  but  this  effect  was  of 
small  amplitude  and  It  occurred  In  a  rather  limited  range  of  Intervals 
(Table  5). 

4.  Nonstandard  AP  sessions 

Unit  activity  In  the  asymptotic  training  stage.  The  average  unit 
responses  were  attenuated  under  all  conditions  of  stimulation  during  the 
nonstandard  AP  sessions,  relative  to  the  responses  In  the  standard  AP 
sessions,  perhaps  due  to  the  contextual  alteration  represented  by  the  novel 
procedures,  or  to  the  refractoriness  of  the  unit  responses  produced  by  the 
brief  Interstimulus  Interval.  This  attenuation  notwithstanding,  clear 
effects  of  the  manipulation  of  stimulus  probability  and  relevance  were 
observed  during  these  session. 

The  rare  CS+  elicited  significantly  greater  excitatory  unit  responses 
than  the  frequent  CS-  during  the  asymptotic  training  stage  In  all  of  the 
Intracranial  regions  (Figure  9,  third  column).  In  all  sites  but  the  AVN, 
the  rare  CS-  and  the  frequent  CS+  elicited  statistically  equivalent,  minimal 
responses  (Figure  9,  lower  right  panel).  In  the  exceptional  case,  the  AVN, 
the  CS+  elicited  a  significantly  greater  response  than  the  CS-  regardless  of 
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the  presentation  frequency  (Figure  9,  upper  right  panel).  A  similar  pattern 
occurred-numerlcally,  but  not  significantly.  In  the  other  thalamic  nucleus, 
the  MON.  Thus,  the  combination  of  rareness  and  task  relevance  Increased  the 
neuronal  responses  In  all  areas. 

These  results  were  Indicated  by  significant  Interactions  of  the 
stimulus  and  Interval  factors  from  separate  analyses  performed  on  the  rare 
CS+  and  the  rare  CS-  sessions  during  the  asymptotic  training  stage  (Table 
6).  Separate  analyses  were  performed  because  about  half  of  the  animals  had 
only  the  rare  CS+  session  and  the  other  half  had  both  rare  CS+  and  rare  CS- 
sesslons.  The  effects  of  rare  CS+  presentations  In  asymptotically  trained 
rabbits  occurred  In  the  170  to  400  ms  range  In  all  sites  except  the  OG.  In 
this  site,  the  effect  consisted  of  a  greater  Inhibition  of  unit  firing  1nr~ 
response  to  the  rare  CS+  than  to  the  frequent  CS-  from  100-140  ms,  and  a 
greater  excitation  to  the  rare  CS+  than  to  the  frequent  CS-  from  240-400  ms. 

Macropotentlals.  The  general  pattern  of  the  unit  response  during  the 
nonstandard  AP  sessions  (l.e.,  an  enhancement  of  the  response  to  the  rare 
CS+)  was  also  observed  In  certain  macropotentlal  records,  including  the  two 
surface  records,  the  OG  and  the  MDN  records  (Figure  10).  Although  not 
significant,  this  pattern  was  also  suggested  by  the  average  macropotentlals 
In  the  AVN.  In  these  records,  the  significant  Interactions  and  Individual 
comparisons  (Table  7)  Indicated  that  the  macropotentlals  elicited  by  the  CS+ 
In  the  rare  CS+  sessions  were  of  significantly  greater  amplitude  than  those 
elicited  by  the  frequent  CS-,  but  there  were  no  significant  amplitude 
differences  in  the  sessions  with  the  rare  CS-  and  frequent  CS+. 

Activity  during  the  pretralnlnq  stage.  The  basic  effect  of  the 
probability  manipulation  during  pretraining  was  the  enhancement  of  thalamic 
neuronal  activity  to  rarely  presented  stimuli  (Figure  9,  upper  left  panels). 
This  effect  occurred  in  the  AVN,  as  Indicated  by  significant  Interactions  of 
the  stimulus  and  interval  factors,  when  the  rare  stimulus  was  the 
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prospective. CS+  (F(39/195l«2.67,  £<.0001),  and  when  It  was  the  prospective 
CS-  (£[39/U7j»5.67,  £<.0001).  In  addition,  the  effect  occurred  In  the  MON 
when  the  rare  stimulus  was  the  prospective  CS-  (£[39/1171-3.03,  £<.0001). 
Also,  the  unit  response  In  the  MON  to  the  rare  prospective  CS+  exceeded  that 
to  the  frequent  prospective  CS-  at  every  Interval,  but  due  to  the  snail 
sample  In  this  case,  this  difference  was  not  significant  In  the  analysis  of 
variance.  Finally,  In  contrast  to  the  significant  effects  of  the 
probability  manipulation  during  pretraining  In  the  thalamic  structures,  no 
significant  effects  occurred  In  the  cortical  structures  (Figure  9,  lower 
left  panels).  The  manipulation  of  stimulus  probability  during  the 
nonstandard  AP  treatments  did  not  significantly  affect  the  Intracranial 
macropotentlals  In  any  of  the  recording  sites  during  pretraining.  — - 

C.  OETAILEO  RESULTS  OF  EXP  2±  SU8ICULAR  LESIONS  AMO  ACTIVITY  IN  THE  PCC  AND 
AVN. 

1.  Unit  Activity. 

In  a  recent  study,  we  reported  that  sublcular  lesions  reduced  the 
magnitude  of  the  average  CS  elicited  unit  response  In  the  PCC,  and  they 
Increased  the  magnitude  of  the  overall  and  the  discriminative  unit  response 
In  the  AVN  (26,28).  The  reduced  PCC  response  occurred  In  the  early  training 
stages,  from  pretraining  to  the  session  of  criterion  attainment;  the  unit 
response  in  the  PCC  Increased  gradually  during  training,  such  that  the 
difference  between  the  records  In  rabbits  with  lesions  and  In  controls  was 
nonsignificant  during  and  after  criterion  attainment.  The  lesion- Induced 
enhancement  of  the  overall  and  discriminative  unit  response  In  the  AVN 
reached  Its  peak  magnitude  during  criterion  attainment  and  postcrlterlal 
overtraining.  These  results.  In  combination  with  the  lesion-induced  loss  of 
the  PCC  unit  response,  form  part  of  the  empirical  basis  for  a  theoretical 
model  of  limbic  system  functional  Interactions  during  learning  (29). 

Here  we  present  the  data  from  the  standard  AP  session  (given 
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Immediately  after  overtraining)  In  which  the  CS+  was  presented  rarely,  arm 
the  CS- -frequently.  These  data  are  relevant  to  the  possible  role  of 
hippocampal  projections  in  governing  the  PCC  and  AVN  neuronal  response  to 
the  CS  probability  manipulation.  Figure  11  shows  the  PCC  unit  responses  In 
Intact  controls  (upper  left  column)  and  In  rabbits  with  lesions  (upper  right 
column)  during  the  the  standard  AP  session.  The  figure  shows  a  reduced  PCC 
unit  response  in  the  rabbits  with  lesions  relative  to  controls.  This  effect 
was  not  significant,  although  as  noted,  the  PCC  responses  In  rabbits  with 
lesions  were  significantly  reduced  In  the  earlier  training  stages. 

The  enhancement  of  the  overall  and  discriminative  AVN  unit  response  in 
rabbits  with  lesions,  present  during  overtraining,  also  occurred  during  the 
standard  AP  session  as  shown  In  the  lower  half  of  Figure  11.  This 
conclusion  was  Indicated  by  a  significant  Interaction  of  the  lesion, 
stimulus  and  Interval  factors  (F[39/5461»2.77,  £<.0001)  In  the  analysis  of 
the  standard  AP  session  data.  Individual  comparisons  Indicated  no 
significant  differences  between  the  lesion  and  control  unit  responses  to  the 
CS-.  However,  the  CS+  In  rabbits  with  lesions  elicited  a  significantly 
greater  average  unit  response  than  the  CS+  In  controls  from  160-400  ms  after 
CS  onset.  Thus,  the  magnitude  of  the  neuronal  response  to  the  rare  CS+  In 
the  AVN  In  rabbits  with  lesions  was  significantly  greater  than  Its  magnitude 
In  control  rabbits.  The  fact  that  the  response  to  the  rare  CS+  and  frequent 
CS-  presentation  In  the  rabbits  with  lesions  was  not  greater  than  the 
response  to  the  equlprobably  presented  CS+  In  the  preceding  overtraining 
session  Indicates  that  the  response  to  the  CS+  was  already  enhanced  during 
this  session,  relative  to  the  control  response,  and  It  was  not  further 
Incremented  by  the  rare  presentation  of  the  CS+  In  the  standard  AP  session. 
This  result  Is  perhaps  to  be  expected  from  the  data  of  Exp  1,  Indicating 
that  AVN  neurons  In  Intact  rabbits  did  not  show  a  specific  response  to  the 
rarely  presented  CS+  during  overtraining. 
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2.  Macropotentials 

The  overall  macropotential  amplitude  and  .he  magnitude  of  the 
discriminative  macropotentlal  response  In  the  PCC  were  enhanced  In  the 
rabbits  with  lesions,  relative  to  controls,  during  overtraining  (Figure  12). 
However,  no  further  alteration  occurred  during  the  following  session  in 
which  the  CS+  was  presented  rarely.  The  effects  of  the  lesions  during 
overtraining  were  Indicated  by  a  significant  Interaction  of  the  lesion, 
stimulus  and  Interval  factors  (F( 39/13261-1. 81,  £<.0019).  Individual 
comparisons  demonstrated  enhanced  responses  In  the  brief-latency  positive 
component  from  60-80  ms  after  CS  onset,  and  In  the  longer  latency  negative 
component  from  170  to  400  ms.  That  Is,  the  positive  wave  elicited  In 
response  to  the  CS+  was  significantly  greater  than  the  positive  wave 
elicited  by  the  CS-,  and  the  negative  wave  elicited  by  the  CS+  was 
significantly  greater  than  the  negative  wave  elicited  by  the  CS-  (compare 
the  profiles  In  the  left  and  right  upper  panels  In  Figure  12).  Both  lesion 
and  control  rabbits  exhibited  enhanced  average  macropotentlals  In  the  PCC  In 
response  to  the  rare  CS+,  relative  to  the  potentials  elicited  by  the 
frequently  presented  CS-  during  the  standard  AP  session  that  followed 
overtraining.  This  outcome  was  Indicated  by  a  significant  Interaction  of 
stimulus  and  Interval  (F 1 39/858 1 -8. 79;  £<.0001).  Again  the  enhancement 
occurred  In  the  brief-latency  positive  deflection  and  In  the  longer  latency 
negative  deflection.  However,  the  absence  of  any  significant  Influence  of 
the  lesion  factor  In  this  analysis  suggested  that  the  presentation  of  the 
rare  and  relevant  CS+  yielded  a  differential  response  In  controls  that  was 
of  the  same  magnitude,  statistically,  as  the  differential  response  In  the 
rabbits  with  lesions  (compare  the  profiles  In  the  left  and  right  lower 
panels  In  Figure  12). 

Thus  far,  the  data  suggest  that  sublcular  lesions  alter  the  overall 
and  discriminative  response  magnitude,  but  not  the  response  to  rare  stimulus 
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presentation.  The  only  disclaimer  to  this  conclusion  was  yielded  by  the  AVN 
macropotential  data  (Figure  12).  Here,  the  sublcular  lesions  Increased  the 
overall  macropotentlal  response  of  the  AVN  during  overtraining  as  Indicated 
by  a  significant  Interaction  of  the  lesion  and  Interval  factors 
(£139/7801-5.03,  £<.0001,  upper  panels  of  Figure  13).  Individual 
comparisons  showed  the  average  macropotentlal  response  to  be  significantly 
more  positive  70  ms  after  CS  onset,  and  significantly  more  negative  from  150 
ms  to  the  end  of  the  analysis  period.  In  the  rabbits  with  lesions,  relative 
to  controls.  The  absence  of  an  effect  of  the  stimulus  factor  In  this 
analysis  suggested  that  the  lesions  did  not  alter  the  magnitude  of  the 
discriminative  response.  Vet,  both  the  overall  and  discriminative  response 
were  Increased  by  the  lesions  during  the  standard  AP  session,  as  Indicated-- 
by  an  Interaction  of  lesion  stimulus  and  Interval  (£[39/5461*7.16,  £<.0001; 
lower  panels.  Figure  13).  Individual  comparisons  Indicated  that  there 
occurred  a  dynamic  negative  macropotentlal  response  to  the  rare  CS+,  and  no 
response  to  the  frequent  CS-  In  the  rabbits  with  lesions  (lower  right  panel. 
Figure  13).  In  contrast,  the  AVNs  In  controls  exhibited  no  trace  of  a 
discriminative  macropotentlal  response  during  the  standard  AP  session  (lower 
left  panel,  Figure  13).  These  data  represent  the  only  Instance  In  this 
study  of  an  altered  neural  sensitivity  to  the  manipulation  of  CS  probability 
In  rabbits  with  lesions.  The  observed  alteration  Is  an  unexpected 
enhancement  of  the  probability  response. 

To  summarize,  the  present  results  are  In  conformity  with  recent 
findings  Indicating  that  hippocampal  projections  to  the  AVN  are  Involved  in 
limiting  the  discriminative  unit  and  macropotentlal  responses  in  this 
structure,  as  well  as  the  discriminative  macropotentlal  response  In  the  PCC. 
The  hippocampal  projections  may  also  limit  the  AVN  macropotentlal  response 
to  rare  CS+  presentation  In  Intact  animals.  These  data  contradict  the 
hypothesis  that  the  neural  response  In  the  PCC  and  AVN  to  rare  and  relevant 
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CS  presentations  require  Intact  projections  from  the  hippocampus. 

DISCUSSION 

This  investigation  is  directed  toward  establishing  whether  the  study 
of  neural  activity  during  avoidance  conditioning  In  rabbits  will  aid  the 
search  for  the  neural  origins  of  ERPs  In  humans.  The  criterion  that  we 
apply  to  this  question  is  the  degree  to  which  neural  activity  In  rabbits 
exhibits  functional  properties  that  are  analogous  to  ERPs  In  humans.  It 
should  be  stressed  that  at  this  time  we  are  looking  for  analogy  In  terms  of 
the  way  In  which  the  activity  In  people  and  In  rabbits  Is  altered  by 
manipulation  of  the  eliciting  conditions.  Whether  the  activities  are  also 
homologous  ( 1 . e. .  Identical  In  terms  of  their  evolutionary  origins)  remains 
to  be  seen. 

Past  studies  of  limbic  neural  activity,  and  the  present  data, 
demonstrate  target  effects,  l.e. ,  the  development  during  training,  of 
different  neural  responses  to  the  positive  CS,  relative  to  responses 
elicited  by  the  negative  CS.  Here  we  demonstrate  In  addition  that  the  unit 
and  macropotentlal  responses  In  these  structures  are  significantly  altered 
by  the  manipulation  of  CS  probability.  Since  target  and  probability  effects 
are  hallmarks  of  the  P300,  these  data  satisfy  the  primary  criteria  for  the 
analogy  that  we  are  seeking.  Furthermore,  the  fact  that  the  target  and 
probability  effects  occurred  In  terms  of  the  firing  of  neurons  localizes 
these  effects  to  the  monitored  structures.  The  Involved  structures  thus 
become  candidates  for  neural  generation  of  the  human  potentials. 

The  principal  reason  for  seeking  an  animal  model  for  the  P300  Is  not 
only  to  demonstrate  and  localize  activity  analogous  to  the  P300.  Indeed, 
that  objective  Is  Important  primarily  as  a  stepping  stone  to  the 
prograimeatlc  Invasive  Investigation  of  the  neural  causal  antecedents  and 
consequences  of  the  analogous  activity.  This  consideration  brings  to  light 
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an  additional  use  of  animal  models:  to  provide  a  mapping  of  the  central 
neural  structures  that  exhibit  ERP  functional  properties,  and  to  document 
regional  differences  In  that  expression.  If  sensitivity  to  stimulus 
relevance  and  probability  were  to  be  exhibited  Identically  In  every 
structure,  under  all  conditions  of  training  and  testing,  the  conduct  of 
programmatic  Invasive  research  would  not  be  greatly  facilitated.  More 
useful  would  be  preliminary  findings  Indicating  the  differential  expression 
of  phenomena  analogous  to  the  P300  In  various  areas  of  the  brain.  This 
differential  expression  was  found  In  the  present  study:  the  effects  of  the 
probability  manipulation  depended  on  the  recording  site  and  the  testing 
procedure.  When  the  probability  of  the  CSs  was  manipulated  during  standard 
AP  sessions  In  fully  trained  rabbits,  rare  presentation  of  the  CS-  enhanceTT 
the  unit  discharge  to  that  stimulus  In  all  of  the  recording  sites.  However, 
rare  presentation  enhanced  the  response  to  the  CS+  In  only  two  of  the  sites, 
the  ACC  and  the  OG.  One  of  the  remaining  sites  (the  PCC)  showed  no  change 
In  response  to  rare  CS+  presentation,  and  the  other  two  (the  AVN  and  the 
MON)  showed  suppression  of  unit  activity  In  response  to  the  rare  CS+. 

Past  results  provide  some  clues  to  the  possible  reasons  for  these 
regional  differences.  For  example,  consider  the  suppression  of  firing  In 
the  AVN  and  MDN  In  response  to  rare  CS+  presentation  (Figure  7).  In  the 
AVN,  the  rare  and  the  frequent  CS+s,  both  unusual  stimuli,  reduced  the 
response  relative  to  the  response  evoked  by  the  CS+  when  It  and  the  CS-  were 
equally  probable.  The  response  reduction  In  the  MDN  occurred  relative  to 
the  response  to  the  frequently  presented  CS+  and  relative  to  the  CS+ 
presented  equally  as  often  as  the  CS-.  These  reductions  of  the  thalamic 
response  can  be  viewed  as  being  compatible  with  a  recent  theoretical  model, 
which  holds  that  the  cingulate  cortices  exert  a  limiting  Influence  on  the 
thalamic  nuclei  with  which  they  are  Interconnected  (29).  One  of  the  factors 
that  Invokes  this  limiting  Influence  Is  the  occurrence  of  unexpected  events 
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(such  as  altered  stimulus  Incidence),  and  the  result  of  limiting  Is  a 
prevention -of  pre- prog ramned  behaviors.  This  prevention  Is  one  of  several 
processes  that  are  Invoked,  hypothetically,  to  promote  behavioral  quiescence 
and  the  direction  of  attention  to  the  novel  events.  Numerous  findings 
provide  the  Inferential  basis  of  this  model.  Including  the  recent  data 
Indicating  that  the  target  effect  In  the  AVN,  and  behavioral  responding,  are 
enhanced  In  rabbits  with  lesions  In  the  PCC  and  also  the  hippocampus,  the 
sole  sources  of  cortical  input  to  the  AVN  (26,28).  The  enhancement  of 
behavioral  responding  appeared  as  a  hyper-response  during  extinction 
training,  when  rabbits  with  Intact  corticothalamic  connections  exhibited 
suppressed  behavioral  and  AVN  responses.  Experiments  to  determine  whether 
the  MON  response  Is  enhanced  In  rabbits  with  damage  In  the  reciprocally  ~ 
Interconnected  ACC  are  currently  In  progress.  In  any  event,  the  basic 
premise  that  arises  from  these  data  Is  that  the  limiting  process  may  be 
responsible  for  the  absence  of  unit  response  Increments  In  the  AVN  and  the 
MON  under  conditions  of  rare  CS+  presentation. 

The  situation  Is  somewhat  more  complex  with  regard  to  the  PCC.  The 
layer  VI  (corticothalamic)  neurons  In  this  region  should,  hypothetically,  be 
the  origins  of  the  limiting  Influence.  Thus,  they  should  be  activated  by 
the  presentation  of  unexpected  events.  However,  the  PCC  Is  also  directly 
activated  by  afferents  from  the  AVN,  the  region  that  is  limited  when 
unexpected  events  occur.  Since  the  AVN  Is  somewhat  suppressed  by  virtue  of 
the  limiting  effect,  the  net  effect  In  the  PCC  could  be  no  significant 
change  In  the  response  due  to  the  alteration  of  CS+  probability.  This  is 
indeed  the  outcome  that  we  observed.  Nevertheless,  the  nonsignificant 
Increase  of  the  PCC  unit  response  to  the  rare  and  frequent  CSs,  relative  to 
the  equal  CS,  are  In  accord  with  the  Idea  that  this  region  Is  the  site  of 
the  neuronal  activity  that  limits  the  firing  of  the  AVN. 

The  unit  responses  to  the  CS-  underwent  a  gradual  reduction,  as  the 
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target  effect  developed  during  training.  However,  the  reduction  was 
reversed-and  the  response  to  the  CS-  dramatically  enhanced  in  all  of  the 
monitored  areas,  during  the  sessions  In  which  the  CS-  was  presented  rarely. 
Yet,  In  no  case  did  the  frequent  presentation  of  the  CS-  also  lead  to  an 
enhancement  of  the  evoked  response.  We  can  say  with  some  confidence  that 
the  enhancement  of  the  response  to  the  CS-  in  the  thalamic  sites  was  not  due 
to  a  lessening  of  the  limiting  process.  The  PCC  and  sublcular  lesions  that 
yielded  the  phenomenon  (enhancement  of  the  AVN  response)  giving  rise  to  the 
concept  of  a  limiting  process,  did  not  Increase  the  AVN  response  to  the  CS-, 
they  only  increased  the  response  to  the  CS+  (op.  clt.).  These  results,  and 
the  absence  of  any  response  enhancement  due  to  frequent  CS-  presentation, 
suggest  that  the  Increased  response  to  the  rare  CS-  was  not  due  to  a 
lessening  of  the  limiting  Influence  from  the  limbic  cortex.  At  present  we 
do  not  know  the  origin  of  this  effect. 

In  contrast  to  the  AVN  and  the  MON,  the  ACC  and  the  OG  have  been 
observed  In  past  studies  to  exhibit  target  effects  predominantly  In  the 
early  stages  of  acquisition  and.  In  the  case  of  the  OG  and  other  hippocampal 
areas.  In  response  to  the  alteration  of  familiar  training  contingencies 
(29).  These  sites  exhibit  a  lessening  of  the  target  responses  during  the 
asymptotic  stages  of  training  (25,29,50).  The  early  and  temporary  nature  of 
the  responses  In  these  structures  suggests  that  these  structures  are 
Involved  In  processes  that  are  Important  during  the  Initial  stages  of 
acquisition,  and  during  adaptation  to  the  occurrence  of  novel  task 
contingencies,  but  that  they  are  not  partlculary  Important  In  relation  to 
performance  of  the  well -practiced  habit.  Thus,  although  we  cannot  at 
present  provide  a  mechanistic  account  of  their  functions,  It  Is  perhaps  not 
surprising  that  rare  presentation  enhanced  the  response  to  both  CS+  and  CS- 
in  these  structures.  Indeed,  the  available  data  suggest  that  these 
structures  should  be  quite  sensitive  to  novel  events,  as  they  appear  to  be 
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Involved  In  the  neural  reprogramming  that  Is  Invoked  whenever  the  subject  is 
faced  with  the  need  to  adapt  to  novel  task  contingencies. 

Olfferent  effects  of  the  probability  manipulation  occurred  during  the 
nonstandard  AP  sessions.  In  which  the  tones  were  presented  at  a  1/sec.  rate, • 
with  response  prevention  and  no  US.  In  these  sessions,  the  neuronal 
responses  In  all  of  the  recording  sites  were  attenuated  relative  to  the 
response  magnitudes  during  the  standard  training  and  AP  test  sessions. 
Nevertheless,  the  combination  of  rareness  and  behavioral  relevance  was 
associated  with  significantly  greater  unit  and/or  macropotentlal  responses 
than  any  of  the  remaining  three  combinations  of  stimulus  properties. 

Perhaps  only  the  rare,  relevant  stimuli  had  sufficient  eliciting  capacity  to 
override  the  attenuating  effects  of  the  unusual  and  unexpected  properties'^ 
the  nonstandard  sessions.  Thus,  the  different  effects  of  the  nonstandard  AP 
treatment,  relative  to  the  standard  AP  treatment,  may  be  attributable  solely 
to  the  disruptive  effects  of  the  nonstandard  treatments,  rather  than  some 
special  variety  of  Information  processing  that  Is  Invoked  by  one  or  more  of 
the  parameters  of  the  nonstandard  sessions. 

The  present  results  Indicate  that  the  effects  of  relevance  and 
probability  manipulation  are  found  ubiquitously  throughout  the  limbic 
telencephalon  and  diencephalon.  Whereas  there  Is  no  reason  to  assume  that 
all  of  these  limbic  sites  contribute  to  fche  P300  In  humans,  these  results 
add  plausibility  to  the  hypothesis  that  the  ERPs  recorded  from  the  human 
scalp  are  manifestations  of  Information  processing  activities  that  are 
rather  widely  distributed  In  the  brain,  rather  than  processes  originating  in 
a  single  structure,  such  as  the  hippocampus.  The  widespread  nature  of  these 
effects  In  the  rabbit  Is  In  agreement  with  suggestions  from  other  recent 
studies  In  animal  and  human  subjects  (43,51,71).  Nevertheless,  It  Is  quite 
possible  that  only  a  very  limited  portion  of  the  widespread  Intracranial 
activity  has  the  necessary  biophysical  properties  that  would  allow  Its 
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activity  has  the  necessary  biophysical  properties  that  would  allow  its 
fields  to  be  recorded  on  the  scalp.  The  resolution  of  this  Issue  depends  on 
the  concurrent  study  of  multiple  Intracranial  locations  and  the  scalp  sites. 
Yet,  the  present  and  past  work  with  the  rabbit  preparation  do  seem  to 
provide  some  Important  Information  in  regard  to  this  issue. 

It  Is  now  well-known  that  the  cingulate  cortical  excitatory  discharges 
that  occur  from  100-400  ms  after  CS  onset  are  critically  dependent  on 
afferents  from  the  limbic  thalamic  nuclei:  lesions  In  these  nuclei 
eliminate  all  CS  elicited  excitatory  and  discriminative  cingulate  cortical 
activity  after  50  ms  following  CS  onset  (7,24).  In  the  present  study,  clear 
neuronal  sensitivity  to  stimulus  probability  and  relevance  occurred  In  the 
thalamic  nuclei,  and  these  effects  also  occurred  In  the  cingulate  cortical- 
areas  driven  by  the  limbic  thalamic  afferents.  Also,  as  we  have  seen  from 
past  work  and  from  the  data  of  the  present  Experiment  2,  the  thalamic 
manifestations  of  sensitivity  to  stimulus  relevance  and  probability  are  not 
dependent  on  Intact  connections  from  the  cortical  areas  (26,28).  These 
considerations.  In  concert,  raise  the  possibility  that  the  sensitivities  to 
stimulus  probability  and  stimulus  relevance  manifested  by  surface  recordings 
made  on  the  human  scalp  may  be  a  product  of  processes  of  the  limbic  and 
other  nonspecific  nuclei  of  the  thalamus.  It  Is  after  all  the  case  that  the 
robust  volleys  of  neuronal  action  potentials  and  ERPs  recorded  at  relatively 
brief  latency  from  the  cortex  and  from  the  scalp  reflect  In  large  part  the 
massive  driving  of  cortical  cells  by  thalamocortical  axons.  Thus,  it  seems 
quite  plausible  to  consider  the  possibility  of  a  thalmlc  origin  for  the 
P300.  Of  course,  this  view  does  not  deny  the  possibility  that  the  P300 
recorded  from  the  scalp,  although  driven  largely  by  thalamic  cells,  can 
nevertheless  by  modulated  Importantly  by  cortl co-cortlcal  Influences,  such 
as  those  that  originate  In  the  sublculum  of  tht  hippocampal  formation.  In 
fact,  the  model  of  limbic  participation  in  the  avoidance  task  that  we  have 
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mentioned  previously  associates  the  occurrence  of  rare  stimuli  with  the 
el  1c1tat4on-of  a  limiting  process,  represented  by  cingulate  cortical  unit 
activity  that  Is  thalamlcally  driven,  but  that  also  requires  an  Input  to 
cingulate  cortex  from  the  hippocampus.  Indeed,  the  macropotentlal  responses 
In  the  hippocampal  dentate  gyrus  elicited  by  rare  CS  presentations  exhibited 
greater  amplitude  and  duration  than  the  potentials  In  any  of  the  other 
studied  areas,  and  the  latency  of  the  response  to  the  rare  CS*  was  briefer 
than  in  the  other  areas. 

The  assertion  that  responses  analogous  to  the  P300  may  depend  on  the 
output  of  the  hippocampal  formation  may  seem  to  be  at  odds  with  other  data 
(51),  and  with  the  results  of  the  present  Exp  2,  suggesting  that  neural 
responses  to  stimulus  relevance  and  probability  manipulations  are  retained 
and  possibly  enhanced  In  animals  with  hippocampal  damage.  However,  In  our 
view,  these  propositions  are  not  necessarily  Incompatible.  The  P300  In 
Intact  humans  may  represent  primarily  the  evocation  of  the  limiting  process 
brought  about  by  the  Interaction  of  thalamic  and  hippocampal  Influences 
within  the  cortical  areas  that  receive  hlppoca^jal  projections.  In  animals 
with  hippocampal  damage,  the  activity  analogous  to  the  P300  may  reflect 
primarily  thalamocortical  neural  volleys  In  these  same  cortical  areas  in 
response  to  rare  and/or  relevant  stimuli.  The  fact  that  these  responses  are 
enhanced  In  the  braln-damaged  animals  may  Indeed  be  due  to  the  absence  of 
the  limiting  process,  the  very  process  that  Is  responsible  for  the  effect  in 
the  Intact  human. 

These  suggestions  must  of  course  be  tempered  by  the  observation  that 
the  morphology  of  the  ERPs  recorded  from  the  brain  surface  In  the  rabbit, 
which  Is  sensitive  to  event  probability.  Is  quite  distinct  from  the 
morphology  of  the  human  P300.  The  potentials  In  the  rabbit  are 
negative-going  In  the  very  same  latency  range  In  which  the  potentials  in 
humans  are  positive.  A  morphology  similar  to  the  P300  appears  in  the  rabbit 
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much  earlier,-  from  100  to  200  ms  after  stimulus  onset.  At  present.  It  Is 
impossible  to  state  which  of  the  potentials  recorded  In  rabbits.  If  any,  is 
the  better  analog  of  the  human  P300. 

In  this  connection  It  should  be  noted  that  there  Is  no  reason  to 
believe  that  the  morphology  of  the  waves  should  be  similar  In  two  such 
different  species.  It  Is  possible  that  the  negative  potential  at  300  ms  is 
analogous  to  the  positive  potential  at  that  latency  In  humans  and  that  the 
difference  In  polarity  Is  the  result  of  relatively  trivial  difference  In  the 
orientation  of  the  electrical  sources,  or  some  other  trivial  difference 
between  humans  and  animals.  On  the  other  hand.  It  Is  possible  that  the 
neural  activity  that  gives  rise  to  the  P300  In  humans  may  originate  In  a 
system,  such  as  the  limbic  system,  that  begins  to  exhibit  sensitivity  to 
stimulus  probability  and  relevance  long  before  300  ms.  These  manifestations 
may  not  appear  on  the  human  scalp  until  300  ms  has  elapsed  because  of 
various  fortuitous  voltage  cancellation  effects  that  occur  In  the  earlier 
latencies.  Such  effects  are  In  fact  likely  given  the  prevalence  of  several 
cascaded  excitatory  and  Inhibitory  discharges  that  characterize  stimulus 
processing  In  a  variety  of  neural  structures  In  the  brief  latency  domain. 

In  this  connection  It  Is  relevant  to  note  that  the  occurrence  of  probability 
and  stimulus  relevance  effects  at  latencies  considerably  under  300  ms  has 
been  found  In  studies  of  human  and  animal  Intracranial  potentials 
(4,46,46,71).  These  data  favor  the  second  Interpretation  outlined  above. 
However  this  Issue  Is  ultimately  resolved,  our  data  clearly  establish  that 
the  rabbit  can  be  used  as  a  subject  In  the  pursuit  of  a  model  of  the  P300. 
This  Is  welcome  news  because  the  rabbit  affords  a  most  convenient 
preparation  for  Investigating  the  neuroanatoey  and  functional 
neurophysiology  of  the  phenomena  analogous  to  the  P300. 
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FIGURE  CAPTIONS 

Figure  1.  A  set  of  six  coronal  sections  of  the  rabbit  brain  Illustrating 
the  minimal  (dark  areas)  and  maximal  (hatched  areas)  extents  of  the 
subicular  lesions.  (AP  *  anterior/posterior  location  of  the  depicted 
sections  In  millimeters.)  The  depictions  are  derived  from  the  atlas  of 
Flfkova  &  Marsala,  presented  In  Bures  et  al.,  1967. 

Figure  2.  Average  of  the  unit  responses  to  the  CS-  during  the  standard  AP 
(asymnetrlc  probability)  sessions.  The  responses  In  each  recording  site 
were  elicited  In  trained  rabbits  during  separate  sessions  In  which  the  C$- 
was  presented  In  .2,  .5  or  .8  of  the  trials.  The  plotted  values  were 
obtained  by  averaging  the  response  magnitudes.  In  the  form  of  z- scores,  of  a 
set  of  10  consecutive  10-msec  Intervals  containing  the  peak  response  of  each 
of  the  sites.  The  10-msec  Interval  between  80  and  400  msec  In  which  the 
z-score  was  the  first,  after  CS  onset,  to  fall  within  the  95%  confidence 
Interval  of  the  largest  z-score  for  a  given  profile  was  selected  as  the 
initial  Interval  of  the  window.  The  recording  sites  are  Indicated  by  the 
arrows  referencing  an  artistic  depiction  of  a  sagittal  section  of  the  rabbit 
brain  at  approximately  1.5  mm  from  the  mldllne.  The  depiction  Is  based  on 
the  atlas  of  Shek,  Wen  and  Wisniewski,  1986. 

Figure  3.  Average  of  the  unit  responses  to  the  CS+  during  the  standard  AP 
(asymmetric  probability)  sessions.  The  responses  In  each  recording  site 
were  elicited  In  trained  rabbits  during  separate  sessions  in  which  the  CS+ 
was  presented  in  .2,  .5  or  .8  of  the  trials.  The  artistic  depiction  of  the 
sagittal  brain  section  and  the  method  for  obtaining  plotted  values  are 
described  In  the  legend  of  Figure  2. 

Figure  4.  The  average  unit  responses  during  the  non-standard  AP  sessions 


that  followed  the  standard  overtraining  sessions.  The  reponses  were 
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elicited  following  the  presentation  of  the  CS+  and  the  CS-  In  asymmetric 
proportions- (.2/. 8:  (Rare  CS+/Frequent  CS-],  or,  .8/. 2:  (Frequent  CS+/Rare 
CS- 1  - )  The  artistic  depiction  of  the  sagittal  brain  section  and  the  method 
for  obtaining  the  plotted  values  are  described  In  the  legend  of  Figure  2. 

Figure  5.  Average  unit  responses  in  the  form  of  z-scores  derived  from  the 
Intracranial  sites  during  pretraining  (left  column)  and  during  the 
asymptotic  training  stage  ("trained";  right  column.)  The  responses  were 
elicited  following  presentations  of  the  CS+  (solid  line)  and  the  CS-  (dashed 
line)  In  equal  (.5/. 5)  proportions.  (Oata  are  shown  for  each  of  the 
recording  sites  In  40  consecutive  10-msec  Intervals.) 

figure  6.  Average  macropotentlal  responses  In  the  form  of  z- scores  derived^ 
from  the  records  of  the  two  brain  (epidural)  surface  sites  (two  upper  rows) 
and  four  Intracranial  sites  during  pretraining  (left  column)  and  during  the 
asymptotic  training  stage  ("Trained";  right  column).  The  responses  were 
elicited  following  presentations  of  the  CS+  (solid  line)  and  CS-  (dashed 
line)  presented  In  symmetric  (.5/. 5)  proportions.  Oata  are  shown  for  six 
recording  site  In  40  consecutive  10-msec  Intervals.  Positive  voltage 
responses  are  Indicated  by  downward  deflections.  The  average 
macropotentials  obtained  from  the  medial  dorsal  nucleus,  very  similar  to 
those  shown  in  the  figure  for  the  anterior  ventral  nucleus,  were  thus 
omitted  for  brevity. 

Figure  7.  Average  unit  responses  In  the  form  of  z-scores  derived  from  the 
five  Intracranial  records  during  the  standard  AP  (asymmetric  probability) 
sessions.  Oata  are  shown  for  each  recording  site  In  40  consecutive  10-msec 
Intervals.  The  responses  were  elicited  by  the  CS+  (left  column)  and  CS- 
(rlght  column)  In  synmetrlc  (.5/. 5)  and  asymmetric  (.2/. 8  and  .8/. 2) 
proportions.  The  responses  elicited  by  the  rare,  equal  and  frequent  CSs  are 
indicated  by  the  solid,  dashed  and  dotted  lines  respectively. 
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Figure  8-.  Average  macropotential  responses  In  the  form  of  2-scores  derived 
from  the  two  surface  records  (top  row)  and  from  the  five  Intracranial 
recording  sites  during  the  standard  AP  (asymmetric  probability)  sessions. 
Data  are  shown  for  each  recording  site  in  40  consecutive  10-msec  intervals. 
The  responses  were  elicited  by  the  CS+  (first  and  third  columns)  and  CS- 
(second  and  fourth  columns)  In  symmetric  (.5/. 5)  and  asymmetric  (.2/. 8  and 
.8/.  2)  proportions.  The  responses  elicited  by  the  rare,  equal  and  frequent 
CSs  are  Indicated  by  the  solid,  dashed  and  dotted  lines  respectively. 

Figure  9.  Average  unit  responses  In  the  form  of  z-scores  derived  from 
records  of  the  anterior  ventral  thalamic  nucleus  and  the  anterior  cingulate 
cortex  during  the  non-standard  AP  training  procedure  which  followed 
pretraining  and  asymptotic  training  (“trained").  The  responses  were 
elicited  by  the  CS+  (solid  lines)  and  CS-  (dashed  lines)  presented  In 
asymmetric  proportions  (.2/. 8:  (Rare  CS+1  or  .8/. 2:  (Rare  CS- ) ) .  The  data 
of  the  posterior  cingulate  cortex  and  medial  dorsal  nucleus,  very  similar 
respectively  to  the  data  of  the  anterior  cingulate  cortex  and  anterior 

ventral  thalamic  nucleus  In  the  figure,  were  thus  omitted  for  brevity. 

Figure  JU).  Average  macropotentlal  responses  In  the  form  of  z-scores  derived 
from  the  two  surface  records  (top  row)  and  from  the  five  Intracranial 
recording  sites  during  the  nonstandard  AP  (asynwetrlc  probability)  sessions. 
Oata  are  shown  for  each  recording  site  In  40  consecutive  10-msec  Intervals. 
The  responses  were  elicited  by  the  CS+  (solid  lines)  and  CS-  (dashed  lines) 
In  asymmetric  proportions.  These  proportions  were  .2/.8  (rare  CS-*-;  left 
columns)  and  .8/. 2  (rare  CS-;  right  columns). 

Figure  XL-  Average  unit  responses  In  the  form  of  z-scores  derived  from  the 
records  of  the  posterior  cingulate  cortex  (upper  row)  and  the  anterior 

ventral  thalamic  nucleus  (lower  row)  In  controls  (left  column),  and  In 
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rabbits  with  sublcular  lesions  (right  column).  The  responses  were  elicited 
by  the  rarefy  presented  CS+  (solid  bars)  and  by  the  frequently  presented 
CS-  (open  bars)  during  the  standard  A P  session  presented  on  the  day  after 
the  final  (third)  overtraining  session  In  EXP  2. 

Figure  12.  Average  macropotential  responses  In  the  form  of  z-scores  derived 
from  the  records  of  the  posterior  cingulate  cortex  In  controls  (left  column) 
and  In  rabbits  with  sublcular  lesions  (right  column)  during  overtraining 
(upper  row)  and  during  the  standard  AP  session  (lower  row)  given  on  the  next 
day.  The  responses  were  elicited  by  the  CS+  (solid  lines)  and  CS-  (dashed 
lines)  In  syimwtrlc  (.5/. 5)  and  asymmetric  (.2/. 8)  proportions. 

Figure  13.  Average  macropotentlal  responses  In  the  form  of  z-scores  derive*! 
from  records  of  the  anterior  ventral  thalamic  nucleus  In  controls  (left 
column)  and  In  rabbits  with  lesions  of  the  dorsal  subclular  complex  (right 
column).  The  data  were  obtained  during  overtraining  (upper  row)  and  a 
subsequent  standard  AP  session  (lower  row).  The  responses  were  elicited 
following  presentation  of  the  CS+  (solid  lines)  and  CS-  (dashed  line)  In 
symmetric  (.5/. 5)  and  asymmetric  (.2/. 8)  proportions. 
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TABLE  CAPTIONS 

Table  l.  The  number  of  unit  and  macropotential  records  are  given  for  each 
brain  area  and  training  stage.  In  the  case  of  the  nonstandard  AP 
(asymnetrlc  probability)  sessions  (right  portion  of  table),  the  number  of 
unit  records  equaled  the  number  of  macropotentlal  records  within  each  cell, 
except  in  the  two  cells  In  which  two  entries  are  given.  In  these  cells,  the 
first  number  represents  the  unit  records  and  the  second,  the 
macropotentlal s. 

Table  2.  UNIT  ACTIVITY  DURING  STANOARO  TRAINING:  Probability  levels,  F 
ratios,  and  statistically  significant  (j><  .05)  Individual  co*>ar1sons 
derived  from  analyses  of  the  Interactions  of  training  stage  X  Interval  and 
training  stage  X  stimulus  X  Interval  In  the  unit  records  of  five  recording 
sites  during  standard  training.  The  Intervals  (In  milliseconds)  In  which 
responses  were  significantly  greater  during  the  asymptotic  training  stage 
than  during  pretraining  are  listed  In  the  third  column.  Intervals 
containing  greater  responses  to  the  CS+  than  to  the  CS-  are  listed  In  the 
fifth  and  sixth  columns.  (N  ■  number  of  records,  NS  »  not  significant,  *  » 
the  response  to  the  CS-  exceeded  the  response  to  the  CS+;  see  Table  1  for 
the  recording  site  labels.) 

Table  3.  INTRACRANIAL  MACROPOTENTIALS  DURING  STANOARO  TRAINING:  Probability 
levels,  F  ratios,  and  statistically  significant  (j><  .05)  Individual 
comparisons  derived  from  analyses  of  the  Interactions  of  training  stage  X 
Interval,  and  training  stage  X  stimulus  X  Interval  In  the  macropotentlal 
records  of  the  PCC,  DG,  AS  and  PS  during  standard  training.  The  Intervals 
(In  milliseconds)  In  which  the  potentials  were  significantly  Increased 
during  the  asymptotic  training  stage  relative  to  the  pretraining  stage,  are 
given  In  the  third  column.  Intervals  In  which  the  potentials  elicited  by 
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the  CS+  were  significantly  greater  than  the  potentials  elicited  by  the  CS- 
are  given  In  the  fifth  and  sixth  columns.  (*»the  potentials  elicited  by  the 
CS-  exceeded  those  elicited  by  the  CS+;  N-number  of  records;  NS-not 
significant.) 

Table  4.  UNIT  ACTIVITY  DURING  THE  STANOARO  AP  SESSIONS:  Probability 
levels,  F  ratios,  and  statistically  significant  (£<  .05)  Individual 
comparisons  derived  from  analyses  of  the  Interactions  of  training  stage  x 
stimulus,  and  training  stage  X  stimulus  X  Interval,  In  the  unit  records  of 
the  five  recording  sites  during  the  standard  AP  session.  The  Intervals  (In 
milliseconds)  In  which  responses  were  significantly  altered  by  the 
probability  manipulation  are  listed  for  each  comparison.  Significantly 
greater  responses  occurred  In  the  listed  Intervals  In  response  to  CSs 
presented  at  the  frequency  Indicated  by  the  first  of  the  two  proportions  (to 
the  left  of  the  slashes),  than  to  the  CSs  presented  at  the  frequency 
Indicated  by  the  second  of  the  two  proportions  (to  the  right  of  the  slashes) 
given  as  the  column  headings.  For  example,  the  first  entry  In  the  fourth 
column  of  the  table  Indicates  that  the  CS+  presented  on  .2  of  the  trials 
elicited  significantly  greater  responses  than  the  CS+  presented  on  .8  of  the 
trials  from  50  to  400  ms  after  CS  onset.  Asterisks  Indicate  a  reversal  of 
this  relation.  That  Is  the  CSs  presented  at  the  frequency  Indicated  by  the 
second  of  the  two  proportions  pair  elicited  greater  responses  than  CSs 
presented  at  the  frequency  Indicated  by  the  first  proportion  of  the  pair. 
(N»number  of  records;  NS-not  significant.) 

Table  5.  SURFACE  MACROPOTENTIALS  OURING  THE  STANOARO  AP  SESSIONS: 

Probability  levels,  F  ratios,  and  statistically  significant  (j><  .05) 
individual  comparisons  derived  from  analyses  of  the  training  stage  X 
stimulus  X  Interval  Interaction  In  the  macropotential  records  of  the  surface 
recordings  during  the  standard  AP  sessions.  The  Intervals  (In  milliseconds) 


ANIMAL  MODEL  FOR  P300 


54 


In  which  the  potentials  were  significantly  altered  by  the  probability 
manipulation  are  listed  for  each  comparison.  See  caption  of  Table  5  for  a 
description  of  the  abbreviations  and  symbols. 

Table  6.  UNIT  ACTIVITY  DURING  THE  NON-STANDARD  AP  SESSIONS:  Probability 
levels,  F  ratios,  and  statistically  significant  (£<  .05)  Individual 
comparisons  derived  from  analyses  of  the  stimulus  X  Interval  Interaction  in 
the  unit  records  during  the  non-standard  AP  sessions.  Data  are  presented 
separately  for  sessions  In  which  the  rare  stimulus  was  the  CS+,  and  for 
sessions  In  which  the  rare  stimulus  was  the  CS-.  The  Intervals  (In 
milliseconds)  In  which  responses  were  significantly  greater  to  the  CS+  than 
to  the  CS-  are  listed  In  the  third  and  sixth  columns.  (#-the  response  to 
the  CS-  exceeded  the  response  to  the  CS+;  N-number  of  records;  NS»not 
significant.) 

Table  7.  MACROPOTENTIALS  OURING  THE  NONSTANDARD  AP  SESSIONS:  Probability 
levels,  F  ratios,  and  statistically  significant  (£<  .05)  Individual 
comparisons  derived  from  analyses  of  the  stimulus  type  by  Interval 
Interaction  In  the  macropotentlal  records  of  the  MDN  and  the  surface 
recordings  during  the  asymptotic  training  stAge  of  the  non-standard  AP 
training  procedure.  Data  are  presented  separately  as  sessions  In  which  the 
rare  CS  was  the  C5+  or  the  CS-.  The  Intervals  (In  milliseconds)  In  which 
responses  were  significantly  greater  to  the  CS+  than  to  the  CS-  are  listed 
(third  column).  (N-number  of  records;  NS-not  significant.) 
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P300  Latency  and  Task  Requirements 

Stephen  Jenkins,  Gabriele  Gratton, 

Michael  G.  H.  Coles,  &  Emanuel  Donchin 

Cognitive  Psychophys iology  Laboratory 
University  of  Illinois 

We  have  argued  elsewhere  (Kutas  et  al,  1977;  McCarthy 
&  Donchin,  1981;  Magliero  et  al,  1984)  that  P3Q0  latency  is 
relatively  independent  of  response-related  processes  and 
that  it  can  be  used  as  a  measure  of  stimulus  evaluation 
t  ime . 


In  the  present  experiment,  we  sought  to  determine  the 
degree  to  which  P300  latency  was  influenced  by  task- 
oriented  decision  processes.  Eight  subjects  performed 
letter-discrimination  and  same-different  judgement  tasks  in 
separate  sessions.  In  both  tasks,  one  of  four  visual 
arrays  ( HHHHH ,  SSHSS,  SSSSS,  and  HHSHH )  was  presented  on 
each  of  1  200  trials.  In  the  letter-discrimination  task, 
subjects  were  required  to  execute  left-  or  right-hand 
responses  as  a  function  of  the  center  letter  in  the  array. 
In  the  same-different  judgment  task,  they  were  required  to 
indicate  whether  all  the  letters  in  the  array  were  the 
same,  or  whether  one  was  different,  by  responding  with 
their  left-  or  right-hands. 

In  the  letter-discrimination  task,  reaction  times  were 
longer  when  the  central  letter  in  the  array  was  different 
from  the  surrounding  letters.  In  the  same-different 
judgment  task,  reaction  times  were  the  same  for  all  arrays. 
However,  P300  latency  was  longer  when  the  central  letter 
was  different  from  the  surrounding  letters  in  both  letter- 
discrimination  and  same-different  judgment  tasks. 

These  results  reveal  that  measures  of  reaction  time 
and  P300  latency  can  be  dissociated  and  support  the  claim 
that  P300  latency  is  independent  of  task-oriented  decision 
processes . 
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NlOO  AND  P300  TUNING  EFFECTS  DURING  AN  ATTENTION  SWITCHING  TASK. 

*  * 

Erik  Sirevaaq  and  Arthur  Kramer  (SPON:  L.  Malmgren). 

Oept.  Psych.,  University  of  Illinois,  Urbana-Champaign,  II  61820. 

The  question  of  central  importance  to  the  study  of  selective  attention 
concerns  how,  given  the  continuous  bombardment  of  information  to  which  our 
central  nervous  system  is  subject,  are  particularly  important  sensations, 
memories  and  representations  selected  for  further  processing?  Dichotic 
listening  tasks  designed  to  simulate  the  "cocktail  party  effect"  --  provide 
convincing  evidence  of  the  ability  of  individuals  to  selectively  attend  to  one 
speaker  and  ignore  a  chorus  of  other  voices.  However,  Lewis  (1970,  JEP,  85, 
225-228)  found  that  if  subjects  repeat  a  random  string  of  words  presented  to 
one  ear,  it  takes  them  longer  if  a  semantically  related  word  is  presented 
simultaneously  to  the  opposite  ear.  This  contextual  interference  represents  a 
failure  of  selective  attention.  Treisman,  Squire,  and  Green  (1974,  Mem,  and 
Cog. ,  2(4) ,  641-646)  found  that  the  effect  of  context  is  restricted  to  items 
in  early  list  positions  which  suggests  that  attentional  selectivity  builds  up 
(becomes  better  tuned)  with  successive  stimulus  presentations. 

Investigations  of  the  electro-physiological  concomitants  of  selective 
attention  have  indicated  that  specific  components  of  the  Event-Related- 
Potential  (ERP)  are  related  to  selection  processes.  A  negative  potential 
peaking  approximately  100  msec  post-stimulus  is  sensitive  to  stimulus  set 
selections  based  on  simple  stimulus  features  (location,  pitch,  color,  etc). 
This  potential  is  presumed  to  reflect  the  overlap  between  the  sensory  NlOO 
component  and  an  attention  related  sustained  negativity  labelled  the 
processing  negativity  (PN).  A  subsequent  component,  the  P300,  is  influenced 
by  a  number  of  response  set  factors  such  as  instructions,  strategies,  and 
complex  as  well  as  simple  stimulus  attributes.  The  P300  has  been  related  to 
the  further  hierarchical  evaluation  of  attended  stimuli. 

Two  separate  sets  of  studies  have  explored  the  nature  of  perceptual  tuning 
effects  upon  components  of  the  ERP.  In  the  auditory  modality,  Donald  and 
Young  (1982,  Exp.  Brain  Res. ,  46,  357-367)  found  that  the  selectivity  of  the 
P300  emerged  irmnediately  with  the  first  few  stimuli  in  a  block  and  did  not 
change  over  succeeding  trials  in  a  sustained  attention  paradigm.  PN  tuning, 
on  the  other  hand,  did  not  emerge  until  after  the  presentation  of  several 
trials  and  resulted  from  a  decrement  in  the  PN  associated  with  the  rejected 
channel  over  time.  In  the  visual  modality,  Hillyard,  Munte,  and  Neville 
(1985,  Atten.  and  Perf.  XI,  Hillsdale:  Erlbaum,  63-84)  have  found  that  the  PN 
selectivity  was  present  for  the  first  stimuli  in  a  sequence  and  remained 


unchanged  with  successive  stimuli  despite  a  general  decline  in  PN  amplitude 
over  time. 

The  purpose  of  the  present  study  was  to  examine  the  buildup  of  auditory 
perceptual  tuning  in  a  paradigm  requiring  subjects  to  switch  attention  to 
different  input  sources  more  rapidly  than  was  required  in  previous  studies. 

Methods 

Fig  1.  Task  Description.  Nine  subjects  (7  male  and  2  female)  participated  in 
the  experiment.  An  arrow  presented  visually  before  each  sequence  of  twelve 
auditory  stimuli  indicated  which  ear  was  to  be  attended  to.  Prior  to  the 
experiment  a  target  frequency  was  designated  for  each  subject.  If  a  tone  was 
presented  to  the  attended  ear  and  was  of  the  target  frequency  a  button  press 
response  was  required.  Subjects  performed  50  blocks  of  these  13  trial 
sequences  followed  by  a  5  min  rest  period.  In  a  single  session,  7  groups  of 
50  blocks  were  performed.  Behavioral  data  (RT  and  accuracy)  and 
electroencephalographic  (EEG)  data  from  three  electrode  positions  (Fz,  Cz,  and 
Pz)  were  recorded  in  each  block.  The  amplitudes  and  latencies  of  a  variety 
of  ERP  components  were  analyzed  as  a  function  of  the  ear,  frequency,  and 
target/non-target  classification  of  the  eliciting  tones.  The  ERP  components 
and  behavioral  data  were  also  examined  as  a  function  of  the  time  point  of  the 
stimulus  following  a  switch  in  attentional  locus  (Time  1,  Time  2,  and  Time  3). 
In  the  subsequent  figures  the  following  convention  is  used  to  designate  the 
four  possible  combinations  of  attended  vs.  ignored  frequency  and  location 
attributes  of  the  tone  stimuli: 

F+  L+  :  Attended  Frequency  in  the  Attended  Location 
F+  L-  :  Attended  Frequency  in  the  Ignored  Location 
F-  L+  :  Ignored  Frequency  in  the  Attended  Location 
F-  L-  :  Ignored  Frequency  in  the  Ignored  Location 


Results 


Fig.  2  Mean  Reaction  Time  as  a  Function  of  Time  Point.  Mean  RT  did  not  vary 
as  a  function  of  time  point  [ F (2,16) =0 .14;  p=0.87). 

Fig.  3  False  Alarm  Rate  as  a  Function  of  Time  Point.  The  number  of  false 
alarms  did  not  vary  as  a  function  of  time  point  [ F (2 , 16)  =  0.90;  p=0 .43.1 

Fig.  4  Miss  Rate  as  a  Function  of  Time  Point.  Significantly  more  misses 
occurred  for  stimuli  at  Time  1  than  for  stimuli  at  either  Time  2  or  Time  3 
(F(2,16)=10.07;  p=0.002|. 

Fig.  5  Sensitivity  as  a  Function  of  Time  Point.  Significantly  lower 
sensitivity  scores  were  associated  with  stimuli  at  Time  1  than  at  Time  3 
[ F (2,16) =4 .91;  p=0.02|. 

Fig.  6  Grand  Average  ERPs  at  Fz  as  a  Function  of  Stimulus  Type.  Prominent 
negativities  occurred  with  a  mean  latency  of  110  msecs.  At  each  of  the  three 
time  points,  more  negativity  was  associated  with  stimuli  of  the  attended 
frequency  F ( 1 , 8) =54. 65 ;  p=.008.  The  ear  of  presentation  had  no  effect 
F( 1,8) =0.15;  p=0 .71. 

Fig.  7  Grand  Average  ERPs  at  Fz  as  a  Function  of  Time  Point.  While  less 
negativity  was  associated  with  tones  of  the  non-target  frequencies  at  later 
points  in  time,  target  frequency  evoked  negativities  remained  unchanged  at  all 
three  time  points  F(2, 16)=6. 25;  p=0.01. 

Fig.  8  PN  Amplitude  as  a  Function  of  Time  Point.  Average  of  the  single 
subject  estimates  of  PN  amplitude  100  msec  post-stimulus  at  Fz  illustrating 
the  increased  separation  between  the  target  and  non-target  frequency  channels. 


.  9  Grand  Average  ERPs  at  Pz  as  a  Function  of  Time  Point.  Large  P300s  were 
associated  only  with  stimuli  matching  on  both  the  target  frequency  and 
location  dimensions  F(l,8)=36.48;  p=0.0Q03.  The  amplitude  of  the  P300  to  these 
tones  increased  as  a  function  of  time  point  F(2,16)=12.18;  p=0.Q006. 

Fig.  10  P300  Amplitude  as  a  Function  of  Time  Point.  Average  of  the  single 
subject  estimates  of  P300  amplitude  at  Pz  illustrating  the  increased 
separation  between  the  tones  matching  both  target  attributes  and  the  other 
tone  categories. 

Conclus ions 

1.  The  behavioral  data  suggest  that  subject  sensitivity  increased  with 
successive  stimulus  presentations  within  a  given  attentional  set.  The  fact 
that  RT  and  false  alarm  rate  were  not  affected  by  time  indicates  that  the 
sensitivity  changes  cannot  be  attributed  to  either  speed-accuracy  tradeoffs  or 
differential  response  biases. 

2.  The  PN  data  are  consistent  with  the  hypothesis  that  increased  perceptual 
tuning  occurred  within  the  12  tone  sequence  and  that  this  was  accomplished  by 
a  reduction  in  the  amplitude  of  the  PN  associated  with  non-target  tone 
frequencies  regardless  of  location.  However,  a  number  of  alternative 
explanations  exist.  Although  extensive  bootstrapping  has  indicated  that  the 
PN  differences  are  probably  not  arti factually  produced  by  differences  in 
refractory  periods  or  number  of  trials  per  average,  it  is  possible  that  the 
sensitivity  of  the  PN  to  frequency  rather  than  ear  is  due  to  such  factors  as 
probability,  the  nature  of  the  dimension  that  switches,  etc.  Further  work  is 
needed  to  clarify  the  precise  nature  of  the  observed  PN  effect. 

3.  The  P300  data  provide  convincing  evidence  of  a  P300  tuning  effect.  In 
contrast  with  the  PN  effect,  P300  tuning  is  accomplished  by  an  increase  in  the 
amplitude  of  P300s  associated  with  target  stimuli. 

4.  The  data  support  the  hypothesis  that  there  is  a  phasic  build-up  of 
attentional  processes  during  tasks  requiring  sustained  attention  when  subjects 
are  required  to  modify  their  attentional  set  after  every  few  trials.  The  data 
are  also  consistent  with  hierarchical  models  of  stimulus  selection. 
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The  present  study  examines  the  development  of  automaticity,  as 
reflected  in  overt  performance  and  in  the  P300  component  of  the  event-related 
brain  potential,  in  healthy  elderly  and  young  adults.  The  subjects  practiced 
consistently  (CM)  and  variably  (VM)  mapped  versions  of  the  Sternberg  memory 
search  task.  Four  elderly  (mean  age  =  74)  and  two  young  (mean  age  =  22)  adults 
performed  the  task  for  four  consecutive  days;  performance  from  the  first  and  last 
days  was  compared. 

Reaction  time  (RT)  increased  with  memory  load  for  both  groups,  although 
the  effect  was  greater  for  the  elderly  subjects.  The  effect  of  memory  load  was 
greater  under  VM  conditions  than  under  CM  conditions  for  young  subjects,  but  for 
elderly  subjects  the  effect  did  not  differ  between  conditions.  Overall,  RT 
performance  improved  over  the  four  sessions,  but  RTs  decreased  more  for  young 
subjects  than  for  the  older  subjects.  Furthermore,  the  effect  of  memory  load 
decreased  with  practice  only  for  the  young  subjects.  Taken  together,  these 
results  suggest  that  the  young  subjects  developed  automatic  processing  more 
rapidly  than  the  elderly  subjects. 

An  examination  of  detection  sensitivity  revealed  that,  under  CM 
conditions.  A'  was  very  high,  and  was  unaffected  by  both  memory  load  and  training 
for  young  subjects.  For  elderly  subjects,  however.  A1  decreased  with  memory  load 
early  in  training;  after  consistent  practice,  A‘  was  relatively  unaffected  by 
memory  load.  Further,  by  session  4,  detection  sensitivity  was  higher  for  elderly 
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subjects  than  for  young  subjects. 

In  VM  training.  A'  decreased  as  a  function  of  memory  load  for  all 
subjects  and  all  conditions,  but  was  lower  for  the  elderly  than  for  the  young 
subjects.  By  session  4,  A*  was  less  affected  by  memory  load  for  the  young  than 
for  the  elderly  subjects. 

Examination  of  B“,  a  measure  of  response  criterion,  revealed  that 
younger  subjects'  performance  became  less  conservative  with  CM  practice.  Elderly 
subjects,  however,  did  not  decrease  their  response  criterion  with  such  practice, 
although  A*  increased  significantly.  With  VM  training,  young  subjects  did  not 
change  their  response  criterion,  whereas  elderly  subjects  became  more 
conservative.  These  results  suggest  that  elderly  subjects  did  not  vary  their 
response  criterion  on  the  basis  of  improved  sensitivity.  It  has  been  previously 
suggested  that  such  a  conservative  strategy  decreases  the  rate  at  which 
automaticity  develops  (Schneider  &  Fisk,  1982). 

In  CM  conditions,  the  effects  of  memory  load  on  P300  latency  were 
equivalent  for  young  and  elderly  subjects  early  in  training.  Following 
consistent  practice,  however,  P300  latency  was  insensitive  to  memory  load  for 
young  subjects,  but  remained  affected  by  memory  load  for  elderly  subjects.  In 
addition,  practice  in  CM  conditions  resulted  in  a  greater  overall  reduction  in 
P300  latency  for  the  young.  These  results  are  consistent  with  the  pattern  of  RT 
results  and  suggest  that  the  stimulus  evaluation  process  became  automated  more 
rapidly  for  the  young. 

In  contrast,  °300  latency  increased  as  a  function  of  memory  load  in  VM 
conditions  for  young  subjects  across  the  four  sessions.  For  the  elderly  subjects, 
however,  P300  latency  was  insensitive  to  memory  load  throughout  the  experiment. 
Moreover,  P300  latency  for  older  subjects  increased  with  training.  These  results 
are  paradoxical,  given  the  pattern  of  RT  results.  Similar  findings  of  P300  slope 
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diminishing  with  age  have  been  reported  by  Pfefferbaum  (1980)  and  Strayer, 
Wickens,  and  Braune  (1985).  One  possible  explanation  for  these  results  is  that 
P300  was  smaller  and  more  variable  for  the  elderly  subjects.  Indeed,  P300 
latency  distributions  for  memory  load  2  and  memory  load  4  were  rectangular  and 
overlapping  for  the  elderly.  Given  the  reliability  of  our  measure  of  P300 
latency  (Gratton,  1984)  these  results  suggest  that  the  latency  of  the  process 
manifested  by  P300  was  more  variable  In  the  elderly. 

The  flat  slopes  observed  following  CM  practice  in  the  young  should  be 
contrasted  with  the  flat  slopes  observed  in  the  elderly  In  the  VM  conditions. 

The  distributions  of  the  young  subjects  were  always  Gaussian,  but  overlapped 
following  consistent  practice,  yielding  a  flat  slope.  The  distributions  of  the 
elderly,  in  contrast,  were  rectangular,  and  thus  the  overlap  of  these 
distributions--and  the  resulting  flat  slope--was  due  to  the  high  variability  in 
the  P300s  of  the  elderly. 

RT  distributions  were  analogous  to  the  P300  latency  distributions: 
following  practice  in  any  condition,  the  variance  of  the  distribution  decreased 
for  all  subjects.  This  effect  was  more  pronounced  in  CM  conditions. 

Distributions  for  the  elderly,  however,  were  more  variable  than  for  the  young 
subjects,  and  the  effect  of  practice  on  the  variance  was  less  for  the  elderly. 
These  results  further  support  the  notion  tl.  t  the  P300  distributions  were 
rectangular  due  to  variability  in  stimulus  evaluation  time. 

In  sum,  it  appears  that  the  performance  of  the  elderly  was 
characterized  by  conservative  response  strategies  and  by  highly  variable  stimulus 
evaluation  and  response  processes.  For  the  elderly  subjects,  performance 
improved  following  CM  practice;  nevertheless,  they  failed  to  attain  automatic 
processing  over  5000  trials.  In  contrast  to  the  young  subjects.  It  remains  to  be 
determined  if  automaticity  would  develop  in  the  elderly  with  additional  training. 
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The  present  study  examines  the  attentional  requirements  of  automatic  and 
controlled  processing.  The  amplitude  of  the  P300  component  of  the  ERP  was  used 
as  a  metric  of  the  attentional  resources  invested  in  a  pair  of  tasks.  Subjects 
performed  two  tasks  (a  Sternberg  Memory  Search  task  and  a  Recognition  Running 
Memory  task)  both  separately  and  together.  Two  stimulus-response  mapping 
conditions  were  employed:  consistent  mapping  (CM)  and  varied  mapping  (VM). 
Processing  priority  was  manipulated  between  the  two  tasks  by  instructions. 
Subjects  received  extensive  training  (>24,000  trials)  prior  to  the  experiment. 

It  was  predicted  that  if  automatic  processing  demands  attentional 
resources,  then  CM  events  should  elicit  large  P300s.  However,  since  the 
resources  utilized  during  the  automatic  attention  response  are  assumed  to  be 
allocated  in  an  all-or-none  fashion  it  was  hypothesized  that  the  amplitude  of 
the  P300's  elicited  in  the  CM  conditions  would  not  tradeoff  with  priority  and 
should  not  show  a  dual  task  decrement.  In  contrast,  if  automatic  processing 
does  not  utilize  or  demand  attentional  resources,  there  should  be  little  or  no 
P300  activity  elicited  by  CM  events.  However,  subjects  may  allocate  spare 
capacity  to  the  automatic  processing  even  though  unnecessary.  In  this  case, 
P300  amplitude  should  vary  as  a  function  of  processing  priority.  As  the 
concurrent  task  priority  increases,  the  spare  capacity  remaining  to  be 
allocated  to  automatic  processing  should  diminish,  resulting  in  a  graded  effect 


on  P300  amplitude. 


Met  hods 


Five  subjects  practiced  two  tasks,  both  separately  and  together  for  twelve 
two  and  one-half  hour  sessions  prior  to  performing  in  the  experimental 
conditions.  This  amount  of  practice  was  necessary  to  ensure  that  the  subjects 
had  developed  automaticity  in  the  CM  conditions.  The  tasks  included  a 
Ster  nerg  Memory  Search  paradigm  and  a  Recognition  Running  Memory  task.  The 
Sternberg  paradigm  was  run  with  two  memory  load  conditions  (set  size  1  and  4) 
and  two  response  mapping  (CM  and  VM)  conditions.  The  Recognition  Running 
Memory  task  was  performed  by  comparing  the  item  on  trial  n  to  the  item  on  trial 
n-2.  Subjects  made  a  CRT  response  on  each  trial  for  each  of  the  tasks.  Task 
priority  was  also  manipulated  in  the  dual  task  conditions  such  that  subjects 
either  concentrated  on  one  task,  gave  equal  priority  to  both  tasks,  or 
maximized  their  performance  on  the  other  task  (priority  conditions:  100/0, 
90/10,  50/50,  10/90,  0/100,  where  the  first  number  refers  to  the  priority  of 
the  recognition  running  memory  task  and  the  second  refers  to  the  Sternberg 


ERP  Recording 


EEC  was  recorded  from  three  midline  sites  (Fz,  Cz,  and  Pz,  according  to 
the  International  10-20  system)  and  referred  to  linked  mastoids.  Two  ground 
electrodes  were  positioned  on  the  left  side  of  the  forehead.  EOG  electrodes 
were  placed  above  and  below  the  right  eye.  Electrode  impedances  did  not  exceed 
10  Kohms. 

The  EEC  and  EOG  were  amplified  with  Van  Gogh  model  50000  amplifiers  (time 
constant  10  sec  and  upper  half  amplitude  of  35  Hz).  Both  EEG  and  EOG  were 
sampled  for  1300  msec,  beginning  100  msec  prior  to  stimulus  onset.  The  data 
were  digitized  every  10  msec.  The  ERP's  were  digitally  filtered  off-line  ( — 3Db 
at  8.8  Hz,  0  Db  at  29  Hz)  prior  to  statistical  analysis.  EOG  artifacts  were 
also  corrected  off-line  (see  Gratton,  Coles  and  Donchin,  1983). 
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Figure  1.  A  schematic  representation  of  the  temporal  structure  of  the 
Sternberg  paradigm  (top),  the  recognition  running  memory  task  (middle),  and  the 
combination  of  the  two  tasks  in  the  dual  task  conditions  (bottom). 


F igur e  _2.  Single  and  dual  task  Sternberg  and  Recognition  Running  Memory  RT's 
for  correct  responses.  On  the  left  side  of  the  figure  the  RT's  are  plotted  for 
both  tasks  in  each  of  the  priority  conditions.  The  right  side  of  the  figure 
displays  the  Performance  Operating  Characteristics  (POC)  for  both  CM  and  VM 
conditions.  Least  squares  polynomial  regression  lines  are  fitted  to  the  data 
for  the  CM  and  VM  set  size  1  and  4  conditions.  Several  effects  are  noteworthy. 
First,  the  RT's  obtained  in  the  CM  conditions  indicated  that  we  were  successful 
in  training  the  subjects  to  automaticity;  the  RT's  were  unaffected  by  memory 
load  or  dual  task  demands  (e.g.,  all  points  in  the  upper  right  hand  portion  of 
the  POC).  Second,  even  though  subjects  received  the  same  amount  of  practice  in 
the  VM  condition  as  they  did  in  the  CM  condition  they  did  not  develop 
automaticity.  RT's  increased  as  a  function  of  memory  load  and  showed  large 
dual  task  tradeoffs  as  a  function  of  processing  priority  for  the  larger  memory 


load  conditions. 


F igure  _J.  Single  and  dual  cask  Sternberg  and  Recognition  Running  Memory  A's 
for  correct  responses  (A'  is  a  nonparametric  measure  of  senstivlity,  d').  On 
the  left  side  of  che  figure  the  A's  are  plotted  for  both  tasks  in  each  of  the 
priority  conditions.  The  right  side  of  the  figure  displays  the  POC's  for  both 
CM  and  VM  conditions.  The  pattern  of  results  obtained  for  the  A'  measure  are 
consistent  with  that  obtained  for  RT.  CM  performance  is  unaffected  by  memory 
load  or  dual  task  demands  while  A'  decreases  as  a  function  of  memory  load  in 
the  VM  condition.  A'  also  varied  as  a  function  of  dual  task  priority  in  the 
set  size  4  VM  conditions. 


Figure  4.  Single  and  dual  task  Sternberg  and  Recognition  Running  memory  Pz 
overplots  for  both  CM  and  VM  conditions.  Perusal  of  the  grand  averages  reveals 
the  classic  N200/P300  complex.  It  is  evident  from  the  waveforms  that  the  P300 
is  less  variable  in  the  CM  Sternberg  conditions  than  it  is  in  the  VM 
conditions.  This  observation  is  consistent  with  the  reduced  variance  in  both 
P300  latency  and  RT  in  the  CM  conditions.  It  is  also  clear  from  the  waveforms 
that  the  priority  effect  shown  in  the  RT  and  A'  POC's  is  obtained  for  P300 
amplitude.  The  amplitude  of  the  P300  varies  with  processing  priority  in  the  VM 
but  not  in  the  CM  conditions. 


Grand  average  P300  amplitudes  at  Pz  for  single  and  dual  task 


F igure  _5 . 

Sternberg  and  Recognition  Running  Memory  tasks.  The  amplitude  and  latency  of 
the  P300's  were  obtained  from  a  single  trial  cross-correlation  procedure.  On 
the  left  side  of  the  figure  the  P300's  are  plotted  for  both  tasks  in  each  of 
the  priority  conditions.  The  right  side  of  the  figure  displays  the  POC's  for 
both  CM  and  VM  conditions.  The  pattern  of  results  obtained  cor  the  P300 
measures  are  consistent  with  the  grand  average  waveforms.  In  CM  conditions, 
large  P300s  were  elicited  by  all  events  and  P300  amplitude  was  uninfluenced  by 
processing  priority.  Furthermore,  dual  task  P300  amplitude  was  equivalent  to 
single  task  P300  amplitude.  In  VM  conditions,  P300  amplitude  varied  as  a 
function  of  processing  priority.  The  greater  the  attention  allocated  to  the 
task,  the  larger  the  P300s  elicited  by  these  events.  Reciprocity  was  found 
between  the  two  tasks  under  VM  load  4  conditions. 
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F igure  _6.  Grand  average  P300  latencies  at  Pz  for  single  and  dual  task 
Sternberg  and  Recognition  Running  Memory  tasks.  On  the  left  side  of  the  figure 
the  P300's  are  plotted  for  both  tasks  in  each  of  the  priority  conditions.  The 
right  side  of  the  figure  displays  the  POC's  for  both  CM  and  VM  conditions.  The 
pattern  of  results  obtained  for  the  P300  latency  measure  mimicked  the  RT 
findings.  P300  latency  increased  as  a  function  of  set  size  and  processing 
priority  in  the  VM  conditions.  In  the  CM  conditions,  P300  latency  was 
insensitive  to  memory  load  and  dual  task  priority  instructions.  Given  chat 
P300  latency  is  sensitive  to  stimulus  evaluation  processes,  but  not  response 
selection  processes,  these  results  suggest  that  stimulus  encoding  and  memory 
comparison  operations  can  be  performed  in  parallel  with  the  information 
processing  activities  of  concurrent  tasks  after  substantial  practice  in  a 
consistently  mapped  environment. 


RT/P300  latency  ratios  for  single  and  dual  task  Sternberg  and 


Figure  7_. 

Recognition  Running  Memory  tasks.  On  the  left  side  of  the  figure  the  ratios 
are  plotted  for  both  tasks  in  each  of  the  priority  conditions.  The  right  side 
of  the  figure  displays  the  POC's  for  both  CM  and  VM  conditions.  A  ratio  greater 
than  1  indicates  that  the  peak  P300  latency  occurred  prior  to  the  overt 
response  while  a  ratio  of  less  than  1  indicates  that  the  P300  followed  the 
overt  response.  In  CM  conditions,  P30C  latency  was  relatively  constant  and  did 
not  vary  as  a  function  of  priority.  In  addition,  reaction  time  preceded  P300 
latency  in  all  CM  conditions  and  the  RT/P300  ratio  did  not  vary  as  a  function 
of  priority.  In  VM  conditions,  P300  latency  increased  with  memory  load  and 
varied  as  a  function  of  priority.  Furthermore,  P300  latency  preceded  RT,  and 
the  RT/P300  ratio  varied  as  a  function  of  priority.  Thus,  as  attention  was 
withdrawn  from  the  task,  the  RT/P300  ratio  increased  in  the  VM  conditions. 


Conclusions 


Taken  together,  these  results  support  the  hypothesis  that  attentional 
resources  are  allocated  to  automatic  processing.  When  a  CM  target  is  presented, 
attention  is  automatically  allocated  to  the  task.  However,  consistent  with  the 
hypothesis  that  the  automatic  attention  response  occurs  in  an  all-or-none 
fashion,  P300  amplitude  did  not  vary  as  a  function  of  priority  in  the  CM 
conditions.  On  the  other  hand,  the  tradeoff  in  resources  between  the  two  tasks 
in  the  VM  conditions  was  reflected  in  the  P300's.  The  amplitude  of  the  P300 
varied  in  a  reciprocal  fashion  with  large  P300's  being  elicited  in  the  task 
that  was  emphasized. 

A  second  purpose  of  the  experiment  was  to  examine  differences  in  automatic 
and  controlled  processing  from  a  chronometric  perspective.  Earlier  research 
(e.g.,  van  Dellen  et  al. ,  1984;  Strayer  and  Kramer,  1986)  suggested  that 
perceptual  processing  becomes  more  efficient  during  the  development  of 
automat ici ty.  If  this  proposition  is  correct,  then  P300  latency  should  be 

uninfluenced  by  changes  in  priority  under  practiced  CM  conditions  and  the 
relative  timing  of  reaction  time  and  P300  latency  should  not  change  as  a 
function  of  priority.  These  predictions  were  supported  by  the  RT/P300  ratio 
data.  The  results  suggest  that  an  efficient  information  extraction  process 
emerges  following  consistent  practice.  This  may  be  likened  to  the  tuning  of  a 
perceptual  filter  and  may  correspond  to  the  "pop  out"  effect,  where  CM  targets 
appear  to  jump  out  of  the  display. 
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Adult  Age  Differences  in  the  Development  of  Automaticity: 

A  Psychophysiological  Assessment 

David  L.  Strayer,  William  J.  Gehring,  Arthur  F.  Kramer,  and  Gregory  A.  Miller 
University  of  Illinois  at  Urbana-Champaign 
Abstract 

This  study  provides  a  fine-grained  analysis  of  the  age-related 
differences  in  the  acquisition  of  automatic  processing.  Following 
consistent  mapping  (CM)  training,  young  subjects  develop  automatic 
processing.  While  elderly  subjects  improve  under  CM  training 
conditions,  this  is  not  due  to  the  development  of  automatic 
processing,  but  to  a  reduction  in  response-related  processing.  Older 
subjects  adopted  a  more  conservative  response  bias  than  the  young, 
which  may  interfere  with  the  development  of  automaticity. 

One  of  the  most  ubiquitous  age-related  changes  in  performance  is  the 
slowing  of  behavior  with  senescence.  However,  automatic  skills  which  are 
acquired  in  young  adulthood  appear  to  be  resistant  to  these  decrements  (Casey  et 
al.,  1987).  Several  investigators  have  sought  to  determine  if  the  elderly  can 
develop  automatic  processing  skills  and  thereby  eliminate  the  age-related 
decrements  in  performance.  Studies  which  have  addressed  this  issue  have  found 
that  the  elderly  generally  do  not  attain  the  same  level  of  asymptotic 
performance  in  tasks  that  require  automatic  processing,  nor  do  they  improve  at 
the  same  rate  as  young  adults  (e.g..  Madden  &  Nebes,  1980;  Nobel  et  al . ,  1964; 
Plude  et  al . ,  1983) . 

The  purpose  of  this  study  is  to  provide  a  fine-grained  analysis  of  the  age- 
related  differences  in  the  development  of  automaticity.  We  will  employ  the 
converging  methodologies  of  additive  factors  logic  and  the  P300  component  of  the 
Event-Related  Brain  Potential  (ERP)  to  localize  the  changes  in  information 
processing  with  practice  and  S-R  mapping.  Several  lines  of  converging  evidence 
suggest  that  the  latency  of  the  P300  component  is  sensitive  to  stimulus 
evaluation  professes,  but  relatively  insensitive  to  response-related  processes 
(e.g.,  Magliero  et  al.,  1984).  Since  a  major  portion  of  the  slowing  with  age 
has  been  localized  to  response-related  processing  (Strayer  et  al.  1987),  we  seek 
to  determine  the  extent  to  which  automatic  processing  can  bypass  this 
information  processing  bottleneck.  We  further  ask  whether  the  stimulus 
evaluation  processes  can  be  automated  in  the  elderly. 

Method 


Subjects 

Eight  young  (mean  age  =  20.6,  sd=1.5  )  and  eight  elderly  (mean  age  =  73.1, 
sd=6.7)  subjects  participated  in  the  experiment.  All  subjects  had  normal  or 
corrected-to-normal  vision  and  were  in  good  health.  Subjects  were  paid  for 
their  participation. 

Procedure 


Subjects  performed  a  variant  of  the  Sternberg  memory  search  task  (1966) 
under  both  consistent  mapping  (CM)  and  varied  mapping  (VM)  conditions.  Each 
block  of  trials  consisted  of  a  memory  set  presented  for  3  seconds,  followed  by 
30  pro^j  trials.  On  each  trial  two  probe  letters  were  simultaneously  presented 
within  1.5  degrees  of  visual  angle.  Subjects  were  instructed  to  press  one 


button  if  either  of  the  probes  was  a  member  of  the  memory  set  and  press  another 
button  if  neither  probe  was  a  member  of  the  memory  set.  The  experiment  was 
conducted  in  four  sessions  (days),  resulting  in  5760  CM  trials  and  5760  VM 
trials. 

ERP  Record inqs 

In  sessions  one  and  four,  EEG  was  recorded  from  Fz,  Cz,  and  Pz  sites.  EEG 
and  EOG  were  sampled  every  10  msec  for  1300  msec,  beginning  100  msec  prior  to 
stimulus  onset.  EOG  artifacts  were  corrected  off-line.  Single  trial  estimates 
of  P300  latency  were  derived  using  a  peak  picking  algorithm  at  the  Pz  electrode 
within  a  window  from  300  to  1150  msec.  Average  ERPs  were  generated  for  each 
experimental  condition.  Each  subject  contributed  a  maximum  of  360  trials  to 
each  average. 

Results 

Our  analyses  will  focus  on  the  first  and  last  sessions  of  practice.  The 
design  is  a  2  (age:  young  vs  old)  X  2  (session:  first  vs  last)  X  2  (mapping:  CM 
vs  VM)  X  2  (set  size:  2  vs  4)  X  2  (response  type:  target  vs  dlstractor) 
split-plot  factorial.  The  results  will  be  hierarchically  organized.  Within  each 
dependent  measure  we  will  examine  the  effects  of  practice  and  S-R  mapping, 
followed  by  an  analysis  of  age-related  differences  in  these  effects.  For  all 
analyses,  a  significance  level  of  .05  is  adopted. 

Reaction  T ime 

Table  1  presents  mean  reaction  time  to  targets  as  a  function  of  age, 
session,  mapping,  and  set  size.  The  linear  regression  slopes  for  memory  set 
size  are  also  presented. 


S-R  Mapping: 
Memory  Set  Size: 

2 

Table 

CM 

4 

1 

Slope 

2 

VM 

4 

Slope 

Young 

Session  1 

469 

533 

31.9 

494 

630 

68.2 

Session  4 

414 

434 

9.9 

454 

572 

58.8 

Old 

Session  1 

652 

737 

42.8 

685 

804 

59.2 

Session  4 

558 

597 

19.2 

619 

752 

66.6 

Reaction  time  decreased  with  practice  F_(l , 14) =19 . 1,  was  shorter  for  CM  than 
for  VM  conditions  F( 1, 14)=114.9,  and  increased  as  a  function  of  set  size  at  a 
greater  rate  in  VM  conditions  than  in  CM  conditions  £(1,14)=61.0.  The 
difference  between  CM  and  VM  conditions  increased  from  session  1  to  session  4, 

F( 1, 14)=17.0,  as  did  the  CM  and  VM  difference  as  a  function  of  memory  set  size, 
F( 1 , 14)  =  10. 3 .  In  addition,  reaction  time  was  shorter  for  targets  than  for 
distractors,  F(l,14)=113.9. 

Elderly  responded  more  slowly  than  young,  F(1 ,14)  =  19.5.  Both  groups 
improved  with  practice.  The  absolute  level  of  reduction  in  the  memory  slope  for 
CM  conditions  was  equivalent;  however,  the  ratio  of  CM  slopes  in  session  1  vs 
session  4  revealed  a  greater  proportional  reduction  for  the  young  (3.22)  than 
for  the  elderly  (2.23).  Furthermore,  the  ratio  of  VM  to  CM  slopes  in  session  4 
was  larger  for  the  young  (5.94)  than  for  the  elderly  (3.47).  This  reflects 
differences  in  asymptotic  levels  of  performance.  The  session  4  CM  slopes  of  the 
elderly  were  twice  the  slopes  of  the  young. 


Detection  Sensitivity 


The  non-parametric  measure  A'  was  adopted  to  determine  changes  in  detection 
sensitivity.  A'  ranges  from  .50  for  chance  accuracy  to  1.0  for  perfect 
detection  accuracy.  A'  was  quite  high  throughout  the  experiment,  ranging  from 
.86  to  .99.  A'  decreased  as  a  function  of  set  size,  £(1 ,14)=94.9,  and  was 
larger  for  CM  than  VM  conditions,  £(1,14)=100.0;  however,  the  decrease  in  A'  as 
a  function  of  set  size  was  larger  for  VM  than  CM  conditions,  £(1,14)=41.7.  A1 
also  increased  from  session  1  to  session  4,  £(1,14)=30.3,  and'this  was  more 
evident  for  VM  than  CM  conditions,  £(1,14)=4.45.  This  interaction  is  probably 
due  to  a  ceiling  effect  for  CM  conditions,  since  initial  A1  values  were  quite 
high. 


A'  increased  from  session  1  to  session  4  more  for  the  elderly  than  for  the 
young,  F(l,14)=7.7,  reflecting  poorer  detection  sensitivity  for  the  elderly 
early  in  training  and  greater  detection  sensitivity  for  the  elderly  following 
training.  Furthermore,  elderly  were  more  affected  by  S-R  mapping,  £(1,14)=6.5, 
and  by  set  size,  £(1,14) =7 . 3 ,  than  the  young.  These  latter  effects  are  heavily 
influenced  by  VM  performance  in  session  1,  where  memory  set  size  produced  its 
greatest  effect  on  the  elderly. 

Response  Bias 

Ihe  non-parametric  measure  B ‘ '  was  used  to  assess  subjects'  response  bias. 
Larger  values  of  81'  reflect  a  more  conservative  response  bias.  In  session  1, 
subjects  adopted  a  more  conservative  response  bias  for  CM  than  VM  conditions; 
however,  in  session  4  the  subjects  responded  more  conservatively  in  VM 
conditions,  £( 1 , 14)= 15 . 5 .  In  addition,  B‘ 1  decreased  with  increasing  memory  set 
size  for  VM  conditions,  but  not  for  CM  conditions,  £(1,14)=12.9,  indicating  that 
subjects  adopted  a  more  risky  response  strategy  to  compensate  for  the  more 
difficult  VM  condition. 

Elderly  responded  more  conservatively  than  young,  £(1,14)=6.2.  Furthermore, 
young  subjects  became  less  conservative  with  practice,  but  elderly  subjects 
became  more  conservative  following  practice,  £(1,14)=4.53.  Elderly  also  tended 
to  respond  more  conservatively  in  CM  conditions  than  VM  conditions,  while  young 
subjects  tended  to  respond  more  conservatively  in  VM  conditions  than  in  CM 
conditions,  £(1,14)=6.6.  This  was  coupled  with  a  tendency  of  the  elderly  to 
become  more  conservative  as  memory  set  size  increased  in  CM  conditions, 

£(1,14) =9 . 2 . 


P300  Latency 

Table  2  presents  mean  P300  latency  to  targets  as  a  function  of  age, 
session,  mapping,  and  set  size.  The  linear  regression  slopes  for  memory  set 
size  are  also  presented. 

Table  2 


S-R  Mapping: 
Memory  Set  Size: 

2 

CM 

4 

Slope 

2 

VM 

4 

Slope 

Young 

Session  1 

590 

654 

32.1 

614 

697 

41.5 

Session  4 

610 

621 

5.6 

611 

707 

47.8 

Old 

Session  l 

708 

738 

14.7 

708 

754 

23.0 

Session  4 

631 

654 

11.6 

678 

738 

29.6 

P300  latency  increased  with  set  size,  £(1,14)=80.1;  however,  the  effect  was 
greater  for  VM  than  CM  conditions,  £(1,14)=13.2,  particularly  in  session  4, 
£(1,14)=6.1.  P300  latency  was  shorter  for  CM  than  VM  conditions,  £(1,14)=23.8, 

and  this  was  more  pronounced  in  session  4,  F(l,14)=16. 1.  In  addition,  P300 
latency  was  shorter  for  targets  than  distractors,  £(1,14)=93.7. 

P300  latency  was  shorter  for  young  than  elderly,  £(1,14)=8.5.  However,  the 
effect  of  memory  set  size  was  greater  for  the  young  than  for  the  elderly, 

F( 1 , 14)=8. 3.  Furthermore,  there  were  differential  effects  of  mapping,  response 
type,  and  session  for  young  and  elderly,  £(1,14)=8.5,  and  mapping  and  set  size, 
£(1,14)=5.0.  For  the  young,  the  effect  of  memory  set  size  produced  equivalent 
effects  on  P300  latency  for  CM  and  VM  conditions  in  session  1,  but  in  session  4 
the  memory  set  size  effect  was  substantially  reduced  in  the  CM  condition.  In 
contrast,  the  effects  of  memory  set  size  were  relatively  constant  across  session 
for  both  CM  and  VM  conditions  for  the  elderly.  The  improvement  with  practice 
can  be  illustrated  by  comparing  the  ratio  of  the  CM  slopes  in  session  1  and  4 
for  the  two  age  groups.  The  CM  sessionl/session4  ratio  for  young  was  5.73  and 
1.27  for  elderly. 

RT/P300  Ratio 

A  single-trial  ratio  of  RT  to  P300  latency  was  calculated  to  determine  the 
proportion  of  stimulus  evaluation  accomplished  at  the  moment  of  response  in  each 
condition.  A  ratio  of  1.0  indicates  that  the  RT  response  and  peak  of  the  P300 
co-occurred.  Ratios  less  than  1.0  indicate  that  the  response  preceeded  P300 
latency  and  ratios  greater  than  1.0  indicate  that  the  response  followed  P300 
latency.  Previous  research  suggested  that  a  large  portion  of  the  age-related 
slowing  is  due  to  response-related  processing  (Strayer  et  al.,  1987).  This 
analysis  was  conducted  to  determine  if  post-stimulus  evaluation  processing  was 
reduced  following  consistent  practice  in  the  elderly. 

The  RT/P300  ratio  decreased  with  practice,  £(1,14)=18.8,  and  increased  with 
set  size,  F( 1, 14)=53.2.  This  latter  effect  was  more  evident  in  session  1, 
F(l,14)=6.7.  Further,  the  RT/P300  ratio  was  larger  for  VM  than  CM  conditions, 

F( 1, 14)=41 .0,  and  this  was  more  pronounced  in  set  size  4,  F(l,14)=20.5. 

The  RT/P300  ratio  was  larger  for  elderly  than  young,  F(l,14)=10.1.  The 
average  RT/P300  ratio  was  0.89  for  the  young  and  1.12  for  the  elderly.  This 
implies  that  elderly  engaged  in  more  post-stimulus  evaluation  processing  prior 
to  their  response  than  the  young.  Age  did  not  enter  into  any  interactions, 
suggesting  that  the  age  differences  in  post-stimulus  evaluation  were  not 
modified  by  practice  or  S-R  mapping. 

Discussion 

Both  young  and  elderly  improved  with  practice.  This  improvement  was 
observed  as  decreases  in  reaction  time  and  increases  in  response  accuracy. 
However,  elderly  responded  more  slowly  and  more  accurately  than  young  following 
practice.  This  suggests  that  the  elderly  were  trading  response  speed  for 
accuracy,  adopting  a  more  conservative  response  bias  than  the  young. 

Reductions  in  the  reaction  time  memory  set  size  slope  were  apparent  for 
both  age  groups,  but  the  young  Improved  more  rapidly  and  achieved  a  lower 
asymptote.  Reductions  in  the  P300  latency  memory  set  size  slopes  were  apparent 
only  for  the  young.  This  suggests  that  the  stimulus  evaluation  processes  become 
automated  for  the  young,  but  not  for  the  elderly.  The  improvement  in  reaction 
time  performance  for  the  elderly  was  attributed  to  a  reduction  in 
response-related  processing  which  was  apparent  as  a  trimming  of  long  latency 


responses.  This  pattern  of  data  suggests  that  the  improvements  in  performance 
of  the  elderly  are  not  the  result  of  automatic  processing,  but  rather  are  due  to 
a  reduction  in  post-stimulus  evaluation  processing. 


One  possible  interpretation  for  why  the  elderly  do  not  acquire  automatic 
processing  may  be  their  conservative  response  bias.  It  has  been  suggested  that 
such  a  conservative  strategy  interferes  with  the  development  of  automaticity 
(Shiffrin  et  al.,  1984).  One  prediction  from  this  interpretation  is  that  if 
elderly  subjects  adopted  a  less  conservative  response  bias,  then  their 
acquisition  rates  and  asymptotes  should  be  similar  to  young.  Further,  If  young 
subjects  adopt  a  more  conservative  response  bias,  then  the  rate  of  improvement 
and  the  asymptote  should  be  similar  to  that  of  the  elderly. 

Thus  age  differences  in  the  development  of  automaticity  appear  to  be  the 
result  of  strategic  changes  in  information  processing. 
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